See full post

Olivier Grisel

ogrisel.bsky.social

Followers · Following

Software engineer at probabl, scikit-learn contributor. Also at: sigmoid.social/@ogrisel github.com/ogrisel

Joined February 2024

Posts Replies Media Original posts Likes

Reposted by Olivier Grisel
Tim Head betatim.bsky.social · Dec 11, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
jtp jtp.io · Nov 24, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Sung Kim sungkim.bsky.social · Nov 13, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Trevon Logan handle.invalid · Oct 27, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Skrub handle.invalid · Sep 26, 2025
⚡ Release 0.6.2 is out ⚡ github.com/skrub-data/s...
Release 0.6.2 · skrub-data/skrub

New features The DataOp.skb.full_report() now displays the time each node took to evaluate. #1596 by Jérôme Dockès. The User guide has been reworked and expanded. Changes and deprecations Ken em...

github.com

View on Bluesky Show all post labels

Olivier Grisel ogrisel.bsky.social · Sep 26, 2025
I will speak about probabilistic regressions, @skrub-data.bsky.social and skore contributors will also present their libraries. Come join us!
- scikit-learn scikit-learn.org · Sep 26, 2025
  A bunch of scikit-learn core contributors will attend or speak at @pydataparis.bsky.social 2025 on Tuesday and Wednesday next week. Ticketing, practical infos and schedule at: pydata.org/paris2025
  PyData Paris 2025
  
  pydata.org
View on Bluesky Show all post labels

Olivier Grisel ogrisel.bsky.social · Sep 2, 2025
scikit-learn 1.8 will be the first scikit-learn release with native extensions that are officially marked as free-threading compatible. github.com/scikit-learn...
MNT Mark cython extensions as free-threaded compatible by lesteve · Pull Request #31342 · scikit-learn/scikit-learn

Part of #30007 Cython 3.1 has been released on May 8 2025. Following scipy PR scipy/scipy#22658 to use -Xfreethreading_compatible=True cython argument if cython >= 3.1 This cleans up the lock-fi...

github.com

View on Bluesky Show all post labels

Reposted by Olivier Grisel
PyData Paris handle.invalid · Aug 28, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Olivier Grisel ogrisel.bsky.social · Aug 28, 2025
Looking forward to attending PyData Paris 2025! I will give a talk about probabilistic predictions for regression problems (I need to start working on my slides ;)
- PyData Paris handle.invalid · Aug 28, 2025
  [Not loaded yet]
View on Bluesky Show all post labels

Reposted by Olivier Grisel
jtp jtp.io · Aug 19, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Olivier Grisel ogrisel.bsky.social · Aug 19, 2025
Today at #EuroScipy2025, @glemaitre58.bsky.social and I presented a tutorial on pitfalls of machine learning for imbalanced classification problems. We discussed what (not) to do when fitting a classifier and obtaining degenerate precision or recall values. probabl-ai.github.io/calibration-...
Imbalanced classification: pitfalls and solutions — Probabilistic calibration of cost-sensitive learning

probabl-ai.github.io

View on Bluesky Show all post labels

Olivier Grisel ogrisel.bsky.social · Aug 18, 2025
Attending the @skrub-data.bsky.social tutorial by @riccardocappuzzo.com and @glemaitre58.bsky.social at #EuroScipy2025. They introduce the new DataOps feature released in skrub 0.6. Here is the repo with the material for the tutorial: github.com/skrub-data/E...

View on Bluesky Download image Show all post labels

Reposted by Olivier Grisel
Lennart Purucker lennartpurucker.bsky.social · Jun 23, 2025
🚨What is SOTA on tabular data, really? We are excited to announce 𝗧𝗮𝗯𝗔𝗿𝗲𝗻𝗮, a living benchmark for machine learning on IID tabular data with: 📊 an online leaderboard (submit!) 📑 carefully curated datasets 📈 strong tree-based, deep learning, and foundation models 🧵

View on Bluesky Download image Show all post labels

Reposted by Olivier Grisel
Gaël Varoquaux handle.invalid · Jul 9, 2025
👨‍🎓🧾✨#icml2025 Paper: TabICL, A Tabular Foundation Model for In-Context Learning on Large Data With Jingang Qu, @dholzmueller.bsky.social, and Marine Le Morvan TL;DR: a well-designed architecture and pretraining gives best tabular learner, and more scalable On top, it's 100% open source 1/9

View on Bluesky Download image (1)Download image (2)Show all post labels

Reposted by Olivier Grisel
David Holzmüller handle.invalid · Jul 24, 2025
Excited to have co-contributed the SquashingScaler, which implements the robust numerical preprocessing from RealMLP!
- Skrub handle.invalid · Jul 24, 2025
  ⚡ Release 0.6.0 is now out! ⚡ 🚀 Major update! Skrub DataOps, various improvements for the TableReport, new tools for applying transformers to the columns, and a new robust transformer for numerical features are only some of the features included in this release.
View on Bluesky Show all post labels

Reposted by Olivier Grisel
David Holzmüller handle.invalid · Jul 29, 2025
I got 3rd out of 691 in a tabular kaggle competition – with only neural networks! 🥉 My solution is short (48 LOC) and relatively general-purpose – I used skrub to preprocess string and date columns, and pytabkit to create an ensemble of RealMLP and TabM models. Link below👇

View on Bluesky Download image Show all post labels

Reposted by Olivier Grisel
PyData Paris handle.invalid · Jul 15, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Mathurin Massias mathurinmassias.bsky.social · Jun 18, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
PyData Paris handle.invalid · Jun 8, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
PyData Paris handle.invalid · Jun 5, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
PyData Paris handle.invalid · May 21, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Guillaume Dalle gdalle.bsky.social · May 16, 2025
The week of September 29th, Paris will become the epicenter of #opensource scientific computing, with a great series of events. This rare alignment creates the perfect opportunity to visit and join a vibrant community of developers, maintainers, and users! Check this out (links in thread) ⬇️

View on Bluesky Download image Show all post labels

Olivier Grisel ogrisel.bsky.social · May 16, 2025
labs.quansight.org/blog/free-th...
The first year of free-threaded Python

A recap of the first year of work on enabling support for the free-threaded build of CPython in community packages.

labs.quansight.org

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Brett Cannon snarky.ca · May 14, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Donghee Na handle.invalid · May 10, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Michael "Shapes Dude" Betancourt handle.invalid · Apr 30, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Adam Johnson adamj.eu · Apr 20, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
PyData Paris handle.invalid · Apr 14, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
jorisvandenbossche jorisvandenbossche.bsky.social · Apr 3, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Skrub handle.invalid · Apr 3, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
PyData Paris handle.invalid · Mar 27, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Steve Klabnik steveklabnik.com · Mar 27, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Hugo van Kemenade hugovk.dev · Mar 25, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
David Picard davidpicard.bsky.social · Mar 21, 2025
🔥🔥🔥 CV Folks, I have some news! We're organizing a 1-day meeting in center Paris on June 6th before CVPR called CVPR@Paris (similar as NeurIPS@Paris) 🥐🍾🥖🍷 Registration is open (it's free) with priority given to authors of accepted papers: cvprinparis.github.io/CVPR2025InPa... Big 🧵👇 with details!

View on Bluesky Download image (1)Download image (2)Show all post labels

Reposted by Olivier Grisel
Gaël Varoquaux handle.invalid · Mar 18, 2025
🎓Paper time!✨ #ICLR spotlight. Concluding of 5 years of research on missing values handling for prediction: Beware of diminishing returns in imputation for prediction. 1/8

View on Bluesky Download image Show all post labels

Reposted by Olivier Grisel
Kyle Lo handle.invalid · Mar 13, 2025
we released olmo 32b today! ☺️ 🐟our largest & best fully open model to-date 🐠right up there w similar size weights-only models from big companies on popular benchmarks 🐡but we used way less compute & all our data, ckpts, code, recipe are free & open made a nice plot of our post-trained results!✌️
- Ai2 handle.invalid · Mar 13, 2025
  Announcing OLMo 2 32B: the first fully open model to beat GPT 3.5 & GPT-4o mini on a suite of popular, multi-skill benchmarks. Comparable to best open-weight models, but a fraction of training compute. When you have a good recipe, ✨ magical things happen when you scale it up!
View on Bluesky Download image Show all post labels

Reposted by Olivier Grisel
Simon Willison simonwillison.net · Mar 13, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Olivier Grisel ogrisel.bsky.social · Mar 14, 2025
Loky 3.5.0 is out! Loky provides an extended version of Python's `concurrent.futures.ProcessPoolExecutor` that leverages cloudpickle to work within interactive Jupyter sessions on all platforms and reuse existing workers to hide the overhead of starting new workers each time.

View on Bluesky Show all post labels

Olivier Grisel ogrisel.bsky.social · Mar 11, 2025
I have the intuition that TabPFN approximates amortized Bayesian inference with a Solomonoff prior via in-context learning. Perplexity agrees :) www.perplexity.ai/search/is-it... I wonder if this theoretical "universality" is one of the reasons for its empirical success.
Is it valid to summarize TabPFN as doing amortized Bayesian inference with a...

Yes, it is valid to summarize TabPFN as performing amortized Bayesian inference with a Solomonoff prior via in-context learning. Here’s why: 1. [Amortized...

perplexity.ai

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Gaël Varoquaux handle.invalid · Feb 7, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Olivier Grisel ogrisel.bsky.social · Mar 7, 2025
Recently merged in scikit-learn's main branch: display the maximum predicted class probability in 2D continuous feature spaces (mostly for didactic purposes): scikit-learn.org/dev/auto_exa... The linked example has been updated to include some conclusions we can draw from this plot.

View on Bluesky Download image Show all post labels
Olivier Grisel ogrisel.bsky.social · Mar 8, 2025
Credits go to @lucyleeow.bsky.social who is now also on Bluesky!

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Chris Holdgraf choldgraf.com · Mar 4, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
PyData Paris handle.invalid · Mar 3, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Sander Dieleman handle.invalid · Jan 22, 2025
📢PSA: #NeurIPS2024 recordings are now publicly available! The workshops always have tons of interesting things on at once, so the FOMO is real😵‍💫 Luckily it's all recorded, so I've been catching up on what I missed. Thread below with some personal highlights🧵

View on Bluesky Show all post labels

Reposted by Olivier Grisel
PyData Paris handle.invalid · Feb 19, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Simon Willison simonwillison.net · Feb 14, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Florent Daudens fdaudens.bsky.social · Feb 10, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
Eugene Vinitsky 🍒 eugenevinitsky.bsky.social · Feb 6, 2025
We've built a simulated driving agent that we trained on 1.6 billion km of driving with no human data. It is SOTA on every planning benchmark we tried. In self-play, it goes 20 years between collisions.

View on Bluesky Download image Show all post labels

Reposted by Olivier Grisel
Sung Kim sungkim.bsky.social · Feb 7, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Olivier Grisel
François Fleuret francois.fleuret.org · Feb 6, 2025
It is hard to overstate how cool and powerful is flex attention. @chhillee.bsky.social pytorch.org/blog/flexatten… TL;DR: it is an implementation of the attention operator in pytorch that allows in particular to efficiently "carve" the attention matrix. 1/3
https://pytorch.org/blog/flexatten…

pytorch.org

View on Bluesky Show all post labels

An unhandled error has occurred. Reload 🗙