Search
- Theoretical Insights on Training Instability in Deep Learning TUTORIAL uuujf.github.io/inst... gradient flow-like regime is slow and can overfit while large (but not too large) step size can trasiently go far, converge faster, and find better solutions #optimization #NeurIPS2025
- How do #optimization algorithms choose solutions? Our hybrid #DataScience @univie.ac.at Talk on 2 Dec with Yurii Malitskyi uncovers the role of implicit bias—how #algorithms favour certain outcomes without explicit instructions—using real-world examples #DSHQ datascience.univie.ac.at/events/talks...