Winnow versus Perceptron - Why adding irrelevant features increases L2(X) but not L∞(X)?

Asked Nov 30 '15 at 12:38

Active Nov 30 '15 at 12:38

Viewed 156 times

I saw here: http://www.cs.cmu.edu/~ninamf/ML11/lect0906.pdf

Intuitively, if “n” is large but most features are irrelevant (i.e. target is sparse but examples are dense), then Winnow is better because adding irrelevant features increases L2(X) but not L∞(X). On the other hand, if the target is dense and examples are sparse, then Perceptron is better.

Why adding irrelevant features increases L2(X) but not L∞(X)?

asked Nov 30 '15 at 12:38

Lee

Winnow versus Perceptron - Why adding irrelevant features increases L2(X) but not L∞(X)?

0 Answers0