Intuitively, the "hidden" dimensions are still coupled to the "visible" dimension, their values influence where it goes next, how "curved" the graph of the visible component is etc. This deterministic influence allows to reconstruct the hidden part of the dynamic from the time series of the visible component.
For a general discussion of the theory and how Takens theorem applies to invariant sets of fractal dimension, see scholarpedia: Attractor reconstruction. A condensed version focused on the Lorenz system can be found in Understanding Takens' Embedding theorem.
As to the Lorenz system specifically I discussed how $y$ and z can be more-or-less reconstructed from $x$ with 2 time delays in Why Lorenz attractor can be embedded by a 3-step time delay map?. This obviously is much less than the embedding dimension 7 of the original Takens theorem and 5 of the fractal version.
In Lorenz Equations, embedding and Takens’ theorem was discussed why the reconstruction is not possible starting from the $z$ values due to the symmetry $(x,y,z)\to (-x,-y,z)$ of the system.
Another way the theorem can "fail" is if you take $2$ independent copies of the Lorenz system with different initial values. Then the $x$ component of the first system will contain no information of the second system. But actually, that this situation is to be excluded in the theorem is included in the "generic" assumption, in that a thin set of signals may have to be excluded from the statement. Conversely, any generic linear combination of both $x$ components should be a valid signal for the combined system. As the combined box-counting dimension should be slightly larger than $4$, an embedding dimension of $9$ (or possibly smaller) would be sufficient.