Random Matrices and Multivariate Statistical Analysis
Iain Johnstone, Statistics, Stanford [email protected]
SEA’06@MIT – p.1 Agenda • Classical multivariate techniques • Principal Component Analysis • Canonical Correlations • Multivariate Regression • Hypothesis Testing: Single and Double Wishart • Eigenvalue densities • Linear Statistics • Single Wishart • Double Wishart • Largest Eigenvalue • Single Wishart • Double Wishart • Concluding Remarks
SEA’06@MIT – p.2 Classical Multivariate Statistics Canonical methods are based on spectral decompositions:
One matrix (Wishart) • Principal Component analysis • Factor analysis • Multidimensional scaling Two matrices(independent Wisharts) • Multivariate Analysis of Variance (MANOVA) • Multivarate regression analysis • Discriminant analysis • Canonical correlation analysis • Tests of equality of covariance matrices
SEA’06@MIT – p.3 Gaussian data matrices . variables Sx ?nd S)SwS?7>DS) z = = cases Sx ?nz
Independent rows: xi ∼ Np(0, Σ),i=1,...n or: X ∼ N(0,In ⊗ Σp)
Zero mean ⇒ no centering in sample covariance matrix: n 1 T 1 S =(S ),S= X X, S = x x kk n kk n ik ik i=1 nS ∼ Wp(n, Σ)
SEA’06@MIT – p.4 Principal Components Analysis
Hotelling, 1933 X1,...,Xn ∼ Np(µ, Σ),
Low dim. subspace“explaining most variance”: