Anton

Feeding LLMs @ Hugging Face

Joined August 2023

Anton anton-l.bsky.social · Feb 12, 2025
LLM Reasoning labs will be eating good today🍔 We commandeered the HF cluster for a few days and generated 1.2M reasoning-filled solutions to 500k NuminaMath problems with DeepSeek-R1 🐳 Have fun!

View on Bluesky Download image Show all post labels
Anton anton-l.bsky.social · Feb 12, 2025
🤗 Dataset: huggingface.co/datasets/ope...
open-r1/OpenR1-Math-Raw · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

View on Bluesky Show all post labels

Anton anton-l.bsky.social · Dec 19, 2024
Introducing 📐FineMath: the best open math pre-training dataset with 50B+ tokens! Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH. 🤗 huggingface.co/datasets/Hug... Here’s a breakdown 🧵

View on Bluesky Download image Show all post labels
Anton anton-l.bsky.social · Dec 19, 2024
First let’s break down how AI labs curate math pre-training datasets 🕵️ DeepSeekMath and QwenMath train a fastText classifier on data like OpenWebMath (OWM). They iteratively filter and recall math content from Common Crawl, focusing on the most relevant domains.

View on Bluesky Download image Show all post labels

Reposted by Anton
Thomas Wolf thomwolf.bsky.social · Dec 11, 2024
The Open LLM Leaderboard got a new front page for Christmas Check it out at huggingface.co/spaces/open-...

View on Bluesky Download image Show all post labels

Reposted by Anton
Guilherme Penedo guilherme.hf.co · Dec 8, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Anton
Andi andimara.bsky.social · Nov 26, 2024
Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs. SmolVLM can be fine-tuned on a Google collab and be run on a laptop! Or process millions of documents with a consumer GPU!

View on Bluesky Download image Show all post labels

Reposted by Anton
merve handle.invalid · Nov 26, 2024
Small yet mighty! 💫 We are releasing SmolVLM: a new 2B small vision language made for on-device use, fine-tunable on consumer GPU, immensely memory efficient 🤠 We release three checkpoints under Apache 2.0: SmolVLM-Instruct, SmolVLM-Synthetic and SmolVLM-Base huggingface.co/collections/...

View on Bluesky Download image Show all post labels

Anton anton-l.bsky.social · Nov 25, 2024
Check out how easy it is to do LLM evals with LightEval! * any dataset on the 🤗 Hub can become an eval task in a few lines of code: customize the prompt, metrics, parsing, few-shots, everything! * model- and data-parallel inference * auto batching with the new vLLM backend

View on Bluesky Download image Show all post labels
Anton anton-l.bsky.social · Nov 25, 2024
Repo: github.com/huggingface/... Here's how we use it for SmolLM 🤏 github.com/huggingface/...
smollm/evaluation at main · huggingface/smollm

Everything about the SmolLM & SmolLM2 family of models - huggingface/smollm

github.com

View on Bluesky Show all post labels

Reposted by Anton
Loubna Ben Allal loubnabnl.hf.co · Nov 24, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Anton
Gabriel Martín Blázquez gabrielmb.com · Nov 21, 2024
[Not loaded yet]

View on Bluesky Show all post labels

open-r1/OpenR1-Math-Raw · Datasets at Hugging Face