elfs: (Default)
[personal profile] elfs
Last night I realized that, given you have two vectors in a 3-dimensional space starting from the same point that are not the same vector, that those two vectors create a plane-- then the same is true for any n-dimensional space. 3 dimensionas, 10 dimensions, 40 dimensions, it doesn't matter.

Why is this important? Simple; once you've identified the plane, you can find the angle between the two vectors.

If can get two people to answer 100 yes or no questions, you can create two vectors in a 100-d space and find the angle between those vectors. Those people with the smallest angle will have the greatest similarities in response. This is the basis of a vast number of recommendation engines.

It was the n-dimensionality that was bugging me. I finally grasped how little that matters in the end.

Date: 2010-03-17 05:18 pm (UTC)
From: [identity profile] en-ki.livejournal.com
Relatedly, the correlation of two variables across n samples is the dot product of the two n-dimensional vectors (once they are recentered to have a mean of zero and scaled by the standard deviation).

Geometrizing statistics is fun:

http://en.wikipedia.org/wiki/Principal_component_analysis
http://www.plosgenetics.org/article/info:doi/10.1371/journal.pgen.0020190

Profile

elfs: (Default)
Elf Sternberg

December 2025

S M T W T F S
 12345 6
78910111213
14151617181920
21222324252627
28293031   

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Jan. 1st, 2026 11:04 am
Powered by Dreamwidth Studios