论文标题
深度学习理论的注释
Notes on Deep Learning Theory
论文作者
论文摘要
这些是我在2020年秋季在莫斯科物理技术研究所(MIPT)和Yandex数据分析学院(YSDA)上发表的讲座的注释。注释涵盖了初始化,损失景观,概括和神经切线内核理论的某些方面。尽管当前版本中缺少许多其他主题(例如表达性,平均场理论,双重下降现象),但我们计划将其添加到以后的修订中。
These are the notes for the lectures that I was giving during Fall 2020 at the Moscow Institute of Physics and Technology (MIPT) and at the Yandex School of Data Analysis (YSDA). The notes cover some aspects of initialization, loss landscape, generalization, and a neural tangent kernel theory. While many other topics (e.g. expressivity, a mean-field theory, a double descent phenomenon) are missing in the current version, we plan to add them in future revisions.