Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks Paper • 2310.02244 • Published Oct 3, 2023
On the Impact of the Activation Function on Deep Neural Networks Training Paper • 1902.06853 • Published Feb 19, 2019
Commutative Width and Depth Scaling in Deep Neural Networks Paper • 2310.01683 • Published Oct 2, 2023
Data pruning and neural scaling laws: fundamental limitations of score-based algorithms Paper • 2302.06960 • Published Feb 14, 2023
On the infinite-depth limit of finite-width neural networks Paper • 2210.00688 • Published Oct 3, 2022
From Optimization Dynamics to Generalization Bounds via Łojasiewicz Gradient Inequality Paper • 2202.10670 • Published Feb 22, 2022
Feature Learning and Signal Propagation in Deep Neural Networks Paper • 2110.11749 • Published Oct 22, 2021