I am currently working on line search algorithms and read about Armijo's Rule. The rule is given for a fixed $\beta \in (0,1)$ and $\eta >1$ we choose $\alpha \geq 0$ such that
$$ f(x+\alpha\cdot d) \leq f(x) + \beta\cdot \nabla f(x)^t\cdot d\cdot \alpha, \tag{$*$}$$ in order for $\alpha$ not being to big. And to ensure that $\alpha$ is not to small we want $(\eta\alpha)$ to not fulfill $(*)$. I have two questions regarding this algorithm:
- In the case that $f \in \mathcal C^1(\mathbb R, \mathbb R)$ do we still consider $f'(x)\cdot d$ or just $f'(x)$ in $(*)$? Because the book states the inequality without the direction variable $d$. I guess it might be because of rescaling $\alpha$ and only having 1-dimensional direction it should not be different.
- When implementing backtracking the given algorithm does not check whether or not $\alpha$ is too small i.e. if $(\eta\alpha)$ does not satisfy $(*)$. Why is that? As far as I understood since $d$ is aleardy a descent direction $\alpha$ being too small only affects the rate of convergence but not the algorithm per se. Is that true and, thus, we do not need to check the second property?