STAT547U: Topics in Deep Learning Theory

Jan-Feb 2025

Final Paper Presentation Assignment

You will choose one of the following papers to read in-depth throughout the course. At the end of the semster, you will have a 30 minute oral examination on your understanding of the paper. (I will post a list of specific questions to prepare later in the semester.) See the syllabus for a detailed description of the assignment.

If you have a paper in mind that is not on this list, chat with me!

TitleTopicsAuthorsVenueLink
Benign Overfitting in Linear Regressionbenign overfittingPeter L. Bartlett, Philip M. Long, Gábor Lugosi, Alexander TsiglerProceedings of the National Academy of Sciences, 2020PDF
Gradient Descent Maximizes the Margin of Homogeneous Neural NetworksclassificationKaifeng Lyu, Jian LiICLR, 2020PDF
Implicit Regularization of Random Feature Modelseffective regularizationArthur Jacot, Berfin Şimşek, Francesco Spadaro, Clément Hongler, Franck GabrielICML, 2020PDF
Kernel and Rich Regimes in Overparametrized Modelsfeature learningBlake Woodworth, Suriya Gunasekar, Jason D. Lee, Edward Moroshko, Pedro Savarese, Itay Golan, Daniel Soudry, Nathan SrebroCoLT, 2020PDF
On Lazy Training in Differentiable Programminginfinite widthLenaic Chizat, Edouard Oyallon, Francis BachNeurIPS, 2019PDF
Gaussian Process Behaviour in Wide Deep Neural Networksinfinite widthAlexander G. de G. Matthews, Mark Rowland, Jiri Hron, Richard E. Turner, Zoubin GhahramaniICLR, 2018PDF
Explaining Neural Scaling Lawsscaling lawsYasaman Bahri, Ethan Dyer, Jared Kaplan, Jaehoon Lee, Utkarsh SharmaProceedings of the National Academy of Sciences, 2024PDF