I've seen this question and answer: Why does Newton's method work?
It gives some geometric intuition as to what is going on when applying Newton's method, but what I really need to know is why it works that way.
How can you prove that each approximation is closer to the correct value than the previous one?
And also, if the following is indeed true, how can you prove that the "gap" between each two subsequent approximations gets smaller and smaller as you keep going up in the sequence of approximations?