Benjamin Lefaudeux 🇺🇦
Back to France after some time in sunny California and happy Copenhagen. Mistral, Photoroom, Meta (xformers, FairScale, R&D), EyeTribe (acq) Mostly writing around AI
- Reposted by Benjamin Lefaudeux 🇺🇦[Not loaded yet]
- Reposted by Benjamin Lefaudeux 🇺🇦[Not loaded yet]
- Reposted by Benjamin Lefaudeux 🇺🇦[Not loaded yet]
- Reposted by Benjamin Lefaudeux 🇺🇦[Not loaded yet]
- "The Serial Scaling Hypothesis" (arxiv.org/abs/2507.125..., Liu et al) is interesting I think, not as new as it completely looks (autoregressive models are used serially, models have depth,..) but feels like a good formalization and intuition as of where current GPT based LLMs will typically fail
- Reposted by Benjamin Lefaudeux 🇺🇦[Not loaded yet]
- In the coming age of agents, I think vibe coding will die out, same lasting power as prompt engineering. For things LLMs excell at, you might as well stick to higher level directives and let it own the work, Claude Code is a good example. 1/2
- Reposted by Benjamin Lefaudeux 🇺🇦[Not loaded yet]
- Still not a lot of ML talk on bsky (at least in my feed), hence paper Sunday: my two most interesting recent reads - H Nets arxiv.org/abs/2507.07955 - Energy Based Transformers arxiv.org/abs/2507.02092
- Little bit of personal news, shared in other circles already: I'm moving to Mistral in August, after three years at Photoroom. I'm really proud of what we built in the ML team with relatively limited means, lasting SOTA on the existing foundations (saliency segmentation) while growing a lot on genAI
- Reposted by Benjamin Lefaudeux 🇺🇦[Not loaded yet]
- Reposted by Benjamin Lefaudeux 🇺🇦[Not loaded yet]
- Alex Nichol is one the rare many-hits researchers of the field, with on top of that a track record of practical models which affect the public/ship. That Meta wouldn't target him is pretty rich
- Automatically generate a fused megakernel in triton.. diving in, but if it works half as well as it reads it would already be quite something. Aligns with torch.compile of course github.com/mirage-proje...