Data Science Decoded

Data Science #10 - The original principal component analysis (PCA) paper by Harold Hotelling (1935)


Listen Later

Hotelling, Harold. "Analysis of a complex of statistical variables into principal components." Journal of educational psychology 24.6 (1933): 417.


This seminal work by Harold Hotelling on PCA remains highly relevant to modern data science because PCA is still widely used for dimensionality reduction, feature extraction, and data visualization.

The foundational concepts of eigenvalue decomposition and maximizing variance in orthogonal directions form the backbone of PCA, which is now automated through numerical methods such as Singular Value Decomposition (SVD).
Modern PCA handles much larger datasets with advanced variants (e.g., Kernel PCA, Sparse PCA), but the core ideas from the paper—identifying and interpreting key components to reduce dimensionality while preserving the most important information—are still crucial in handling high-dimensional data efficiently today.

...more
View all episodesView all episodes
Download on the App Store

Data Science DecodedBy Mike E

  • 3.8
  • 3.8
  • 3.8
  • 3.8
  • 3.8

3.8

5 ratings


More shows like Data Science Decoded

View all
Radiolab by WNYC Studios

Radiolab

43,974 Listeners

My Favorite Theorem by Kevin Knudson & Evelyn Lamb

My Favorite Theorem

100 Listeners

WW2 Pod: We Have Ways of Making You Talk by Goalhanger

WW2 Pod: We Have Ways of Making You Talk

1,446 Listeners

The Rest Is History by Goalhanger

The Rest Is History

15,865 Listeners