Orion Weller
- Ever wonder how test-time compute would do in retrieval? 🤔 introducing ✨rank1✨ rank1 is distilled from R1 & designed for reranking. rank1 is state-of-the-art at complex reranking tasks in reasoning, instruction-following, and general semantics (often 2x RankLlama 🤯) 🧵
- rank1 generates a reasoning chain before the final answer (usually ~250 tokens). Try a live demo: huggingface.co/spaces/orion... Our data (600k) and models are open source, check them out: huggingface.co/collections/... 📝: arxiv.org/abs/2502.18418 Keep reading to see what surprised us 😮
- Despite training on English only data and from base LMs (no instruct-tuning) rank1 models excel at instruction following AND are inherently promptable. They even are SOTA in multilingual instruction following, despite using no multilingual IR data 🤯
-
View full threadThanks as always to my advisors/coauthors at @jhuclsp.bsky.social including @vandurme.bsky.social Dawn Lawrie, Eugene Yang, Kathryn Ricci, and Andrew Yates!
- Reposted by Orion WellerWe use this collection of tasks to propose multiple benchmarks for multilingual, code, European and Indic languages, and many more. We find that smaller multilingual models (~500M) outperform notably larger 7B models, likely due to a limited multilingual pre-training.
- Check out our new encoder model, ModernBERT! 🤖 Super grateful to have been part of such an awesome team effort and very excited about the gains for retrieval/RAG! 🚀
- Still lots of areas to improve (multilingual data anyone 👀) but really happy with how successful this was! I've even been looking at how it works for instruction-based retrieval and turns out that having modern data helps a lot 🔥 Excited to see what you do with it!
- MASC is such a fun time! If your university is in the mid-Atlantic, please consider hosting!
- 📢 Want to host MASC 2025? The 12th Mid-Atlantic Student Colloquium is a one day event bringing together students, faculty and researchers from universities and industry in the Mid-Atlantic. Please submit this very short form if you are interested in hosting! Deadline January 6th. #MASC2025
- Reposted by Orion WellerI'm looking for an intern to introduce Sparse Embedding models to Sentence Transformers! If you're passionate about open source, interested in helping practitioners use your tools, and enjoy embedders/retrievers/rerankers, then I'd love to hear from you! Links with details and to apply in 🧵
- Reposted by Orion WellerI noticed a lot of starter packs skewed towards faculty/industry, so I made one of just NLP & ML students: go.bsky.app/vju2ux Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!at://did:plc:stalio2ctcgm6vmhqqksv3yr/app.bsky.graph.starterpack/3lbn6teyljo2e