19

Two vectors $v,w \in V$ in a vector space $V$ with a scalar product $\langle \cdot,\cdot\rangle $ (we assume that $\langle v,w\rangle =\langle w,v\rangle $) are defined to be orthogonal if the scalar product $\langle v,w\rangle =0$. Lets suppose we have a normed vectorspace where the norm is defined with our scalar product in the following way

$$ \|x\|:= \sqrt{\langle x,x\rangle } \label{1}\tag{1}$$ It is easy to prove that $$\|v-w\|^2=\|v\|^2+\|w\|^ 2 \iff \langle v,w\rangle =0 \label{2}\tag{2}$$ if the norm is defined like in (\ref{1}).

To prove it in the one direction:

$"\Rightarrow"$ we start with $$\|v-w\|^2=\|v\|^2+\|w\|^ 2$$ lets use (\ref{1}) and substitute the norm $$ \langle v-w,v-w\rangle =\langle v,v\rangle +\langle w,w\rangle $$ $$ \langle v,v-w\rangle -\langle w,v-w\rangle =\langle v,v\rangle +\langle w,w\rangle $$ $$ \langle v,v\rangle -\langle v,w\rangle -(-\langle w,w\rangle +\langle w,v\rangle )=\langle v,v\rangle +\langle w,w\rangle $$ $$ \langle v,v\rangle -2\langle v,w\rangle +\langle w,w\rangle = \langle v,v\rangle +\langle w,w\rangle $$ when we substract $\langle v,v\rangle $ and $\langle w,w\rangle $ from both side and divide by $-2$ there is left $$\langle v,w\rangle =0$$

$"\Leftarrow"$ the other way of the equivalence we prove it by contradiction suppose: $$\|v-w\|^2 \neq \|v\|^2+\|w\|^ 2-2\langle v,w\rangle $$ Then lets substitute the norm with the scalar product we get: $$\langle v,v\rangle +\langle w,w\rangle -2\langle v,w\rangle \neq \langle v,v\rangle +\langle w,w\rangle -2\langle v,w\rangle $$ so this is wrong so $$\|v-w\|^2 = \|v\|^2+\|w\|^ 2-2\langle v,w\rangle $$ is true. Since we supposed that also $\langle v,w\rangle =0$ is true then also $$\|v-w\|^2=\|v\|^2+\|w\|^2$$ is true. So the the equivalence (\ref{2}) is proven.

My question is now why use scalar product to define orthogonality? A vector space not necessarily needs a scalar product for orthogonality. One could also define orthogonality by saying two vectors are considered to be orthogonal if the equation $\|v-w\|^2=\|v\|^2+\|w\|^ 2$ holds.

We could define the concept orthogonality even more general for a metric space $(X,d)$.

With $x,y \in X$ while $X$ should be at least an unital magma with $0$ as the neutral element. So we would say $x,y$ are orthogonal when $$d(x,y)^2=d(0,x)^2+d(0,y)^2$$ Or is there a general problem with it?

oggiman
  • 447
  • @DietrichBurde actually my question is: Could we define orthogonality even if we do not have a bilinear form? Do we need it really if we already have a norm or can we define orthogonality without bilinear form when we have atleast a norm? A norm not necesarly need a bilinear form or scalar product to be defined. – oggiman Jan 10 '25 at 19:13
  • Of course we can define orthogonality in a different way. See wikipedia for different, but equivalent definitions (perpendicular, or dot product equal to zero etc.). But the most general definition is by $B(v,w)=0$ for a bilinear form for a vector space. Then not even a metric is required. I don't see a reason why we should not define orthogonality this way. – Dietrich Burde Jan 10 '25 at 19:22
  • See also this post for the proof you did. – Dietrich Burde Jan 10 '25 at 19:34
  • 4
    The word "orthogonal" literally means right-angled. So it seems to me that we're talking about angles, not just lengths. – Ted Shifrin Jan 10 '25 at 19:56
  • 4
    You should check out how your proposed definition of orthogonality works out in the $\sup$ norm. – Rob Arthan Jan 10 '25 at 22:13
  • Interestingly, your orthogonality criterion corresponds to Pythagoras' theorem. I.e. "two vectors are ortho if the triangle they form obeys Pythagoras' theorem". – Al.G. Jan 11 '25 at 11:40

2 Answers2

36

(Edit: I have changed the norm from $1$-norm to $\infty$-norm, as suggested by Rob Arthan in a comment to the question.)

If your norm is not induced from an inner product, or more generally, if orthogonality is not defined via some bilinear/sesquilinear form, one issue it may raise is that orthogonality may no longer play along with linearity. E.g. two orthogonal vectors may not remain orthogonal if you scale one of them. To illustrate, consider $$ v=\pmatrix{3\\ 1},\quad w=\pmatrix{0\\ -4},\quad v-w=\pmatrix{3\\ 5},\quad 2v-w=\pmatrix{6\\ 6}. $$ In your definition, $v$ is orthogonal to $w$ with respect to the supremum norm: $$ \|v-w\|_\infty^2=5^2=3^2+4^2=\|v\|_\infty^2+\|w\|_\infty^2, $$ but $2v$ is not orthogonal to $w$: $$ \|2v-w\|_\infty^2=6^2\ne 6^2+4^2=\|2v\|_\infty^2+\|w\|_\infty^2. $$

user1551
  • 149,263
6

Basically the answer is that if you have a norm that can be used to define orthogonality, it can also be used to define a dot product.

The definition for orthogonality you derived was $\|\bar v-\bar w\|=\|\bar v\|^2+\|\bar w\|^2$. You could similarly and equivalently derive $\|\bar v+\bar w\|=\|\bar v\|^2+\|\bar w\|^2$ which I'll be using. If this works as a definition for orthogonality it should also hold if we scale the lengths of these vectors by scalars, i.e. $\|a\bar v+b\bar w\|=a^2\|\bar v\|^2+b^2\|\bar w\|^2$, because just scaling lenghts of the vectors doesn't change the angle between them. Let's take take two arbitrary vectors $\bar x$ and $\bar y$ which are in the plane spanned by the orthogonal vectors $\bar v$ and $\bar w$. Thus we have $\bar x=a\bar v+b\bar w$ and $\bar y=c\bar v+d\bar w$ for some scalars $a$, $b$, $c$, and $d$. From this we can derive

$$\|\bar x\|^2=a^2\|\bar v\|^2+b^2\|\bar w\|^2,$$ $$\|\bar y\|^2=c^2\|\bar v\|^2+d^2\|\bar w\|^2.$$

We can also derive

\begin{align} \|\bar x+\bar y\|^2 &=\|(a+c)\bar v+(b+d)\bar w\| \\ &=(a+b)^2\|\bar v\|^2+(b+d)^2\|\bar w\|^2 \\ &=(a^2+c^2+2ab)\|\bar v\|^2+(b^2+d^2+2cd)\|\bar w\|^2 \\ &=\|\bar x\|^2+\|\bar y\|^2+2(ab\|\bar v\|^2+cd\|\bar w\|^2) \end{align}

and

\begin{align} \|\bar x-\bar y\|^2 &=\|(a-c)\bar v+(b-d)\bar w\|=(a-b)^2\|\bar v\|^2+(b-d)^2\|\bar w\|^2 \\ &=(a^2+c^2-2ab)\|\bar v\|^2+(b^2+d^2-2cd)\|\bar w\|^2 \\ &=\|\bar x\|^2+\|\bar y\|^2-2(ab\|\bar v\|^2+cd\|\bar w\|^2). \end{align}

From these we get

$$\|\bar x+\bar y\|^2+\|\bar x-\bar y\|=2(\|\bar x\|^2+\|\bar y\|^2).$$

This is called the parallelogram law and here we derived that for this kind of norm it must hold for all vectors (or at least those in the plane spanned by these orthogonal vectors). But if the norm satisfies this law we can easily prove that we define a dot product using the norm in one of the following ways:

\begin{align} \langle\bar v,\bar w\rangle &=\tfrac{1}{2}(\|\bar x\|^2+\|\bar y\|^2+\|\bar x+\bar y\|^2) \\ &=\tfrac{1}{2}(\|\bar x\|^2+\|\bar y\|^2-\|\bar x-\bar y\|^2) \\ &=\tfrac{1}{4}(\|\bar x+\bar y\|^2+\|\bar x+\bar y\|^2). \end{align}

You can reasonably easily check that a dot product defined this way satisfies all the properties required of a dot product when the norm satisfies the parallelogram law.

Here I'm assuming the the field of scalars is the real numbers. In case of complex scalars you can define an inner product using a norm in a similar though slightly more complicated way.

  • 15
    Sorry but no, not every norm yields an inner product: the norm has to satisfy the parallelogram law, a criterion due to Fréchet, von Neumann, and P. Jordan. Furthermore, | gives nice norms $| \cdot|$ ... – Hanno Jan 10 '25 at 20:11
  • 2
    Hanno is right, check this post for more details :https://math.stackexchange.com/questions/3124666/can-we-define-an-inner-product-in-terms-of-the-norm-induced-by-it – Hrodelbert Jan 10 '25 at 20:12
  • 1
    @Hanno I guess I should add that criterion to my answer. But I'm assuming that defining orthogonality similarly only works if the norm satisfies the parallelogram law. Am I correct? – QuantumWiz Jan 10 '25 at 20:17
  • Did you look up the linked post pointed to by Hrodelbert, in particular the first comment to the answer? Or you may read https://math.stackexchange.com/a/2174427/316749 – Hanno Jan 10 '25 at 21:26
  • @Hanno I did read it though not the comments (until now). In any case, it doesn't seem to directly answer the question I asked in my previous comment. – QuantumWiz Jan 10 '25 at 22:13
  • @Hanno In case it wasn't clear, my question in my comment was whether I'm right about that defining orthogonality with a norm works when the norm satisfies the parallelogram law (i.e. exactly when you can define a dot product using the norm). – QuantumWiz Jan 10 '25 at 22:43
  • 3
    Yes; defining orthogonality (in the usual way via dot/inner product) with a norm works if and only if the norm satisfies the parallelogram law. Which repeats the comment alluded to above. – Hanno Jan 11 '25 at 09:42
  • @Hanno OK, I updated my answer. Do you think it's a good answer now? Feel free to suggest further improvements. – QuantumWiz Jan 11 '25 at 18:05
  • @Hrodelbert Do you think my updated answer is a good one? – QuantumWiz Jan 11 '25 at 18:06
  • It can be clearer (streamline the logic, use $\vec{x}$ instead of $\overline{x}$, etc.), but it does illustrate the point we have been discussing here in the comments. I think a clearly written expose could be a useful answer to this question. Note that the downvoters will probably not come back to revisit their vote after your edits, though. Given that it is substantially different to your original, I think posting it as a new answer could be justified. – Hrodelbert Jan 12 '25 at 11:07
  • @Hrodelbert Is deleting and reposting an answer to get rid of downvotes allowed? It kind of feels like cheating, even if you could argue that it's justified in this case. Maybe I'll ask in the Meta about it. – QuantumWiz Jan 12 '25 at 14:29
  • @Hrodelbert I asked about this in the Meta and got the answer that deleting and reposting is not allowed. – QuantumWiz Jan 12 '25 at 15:56
  • You can revert this to the original answer and delete it, then posting a different answer is allowed. I have not read through the answer, its edit history, or these comments to determine if your new answer is actually different or worth posting at all, though. My first impression glancing was that it was an unnecessary diversion. – coiso Jan 12 '25 at 16:07
  • @coiso The answer I got on Meta was pretty categorical. I don't think I'll risk potentially breaking rules. – QuantumWiz Jan 12 '25 at 16:11
  • 1
    Although I agree that posting a different answer should be allowed, a better solution, I think, would have been to revise the defective answer, at the same time placing a notification at the top of the post acknowledging the error and stating the the current answer is a complete rewrite. That might make it more likely that users who downvoted will notice and reverse their downvotes. At this point, I think coiso's suggestion is no longer viable, given that the revised post has been upvoted a few times. At least it's no longer grayed out. – Will Orrick Jan 13 '25 at 04:00