漫蛙漫画

Wei Lin @ PKU

Math 867: High-Dimensional Data Analysis and Statistical Inference

Course Description

This is a graduate and advanced undergraduate level course in high-dimensional statistics, introducing the fundamental principles for the statistical modeling, analysis, and inference of high-dimensional and big data. High-dimensional regression, large covariance estimation, and large-scale hypothesis testing will be covered, along with necessary mathematical tools such as concentration inequalities and random matrix theory, as well as optimization algorithms such as coordinate descent and the alternating direction method of multipliers. Large data sets of various types will be presented and analyzed.

Syllabus

Announcements

  • Class on May 6 moved to Saturday, May 9, 3:10–5:50 pm, 1114 Science Building 1. If possible, please attend the by Professor Alan Gelfand.

Lecture Schedule

Week Date Topic References
1 March 4 Introduction ,
2 March 11 Concentration inequalities
Sparse linear regression
Boucheron, Lugosi & Massart Chapters 1 & 2
,
3 March 18 Sparse linear regression
Sparse GLMs
,
4 March 25 Group Lasso
Structured sparsity
,
5 April 1 Large-scale optimization
Discussion: Microbiome data analysis
,
6 April 8 Variable screening
Bayesian Lasso
Discussion: Chromatographic fingerprints


7 April 15 Sparse covariance estimation , ,
8 April 22 Sparse inverse covariance estimation , ,
9 April 29 Consistency of PCA
Sparse PCA
Discussion: Text analysis

, ,
10 May 9 Matrix perturbation theory
Random matrix theory
,
11 May 13 Low-rank matrix recovery
Discussion: Star formation in galaxies
,
12 May 20 False discovery rate control Efron Chapter 4,
13 May 27 Two-sample mean tests
Two-sample covariance tests
Discussion: Topological inference in neuroimaging
,
,
14 June 3 Scaled Lasso
Confidence intervals and tests
Discussion: Change detection in remote sensing
,
, ,
15 June 10 Office hours (schedule)
16 June 17 Presentations (schedule)

Homework and Projects