See full post

Search

Posts Media People Feeds

Memming Park handle.invalid · Dec 7, 2025
Replying to Memming Park
Theoretical Insights on Training Instability in Deep Learning TUTORIAL uuujf.github.io/inst... gradient flow-like regime is slow and can overfit while large (but not too large) step size can trasiently go far, converge faster, and find better solutions #optimization #NeurIPS2025

View on Bluesky Show all post labels

Data Science @ Uni Vienna ds-vienna.bsky.social · Nov 29, 2024
How do #optimization algorithms choose solutions? Our hybrid #DataScience @univie.ac.at Talk on 2 Dec with Yurii Malitskyi uncovers the role of implicit bias—how #algorithms favour certain outcomes without explicit instructions—using real-world examples #DSHQ datascience.univie.ac.at/events/talks...

View on Bluesky Download image Show all post labels

An unhandled error has occurred. Reload 🗙