Andi: 🚀 We just dropped SmolDocling: a 256M open-source vision LM for complete document OCR! 📄✨ Lightning fast, process a page in 0.35 sec on consumer GPU using < 500MB VRAM ⚡ SOTA in document conversion, beating every competing model we tested (including models 27x more params) 🤯 But how? 🧶⬇️

See full post

Andi andimara.bsky.social
🚀 We just dropped SmolDocling: a 256M open-source vision LM for complete document OCR! 📄✨ Lightning fast, process a page in 0.35 sec on consumer GPU using < 500MB VRAM ⚡ SOTA in document conversion, beating every competing model we tested (including models 27x more params) 🤯 But how? 🧶⬇️
Mar 17, 2025 15:53
0 reposts 0 quotes 0 likes

View on Bluesky Download image Show all post labels
Andi andimara.bsky.social · Mar 17, 2025
How does SmolDocling beat models 27× bigger? SmolDocling transforms any document into structured metadata with DocTags, being SOTA in: ✅ Full-page conversion ✅ Layout identification ✅ Equations, tables, charts, plots, code OCR

View on Bluesky Download image Show all post labels
Andi andimara.bsky.social · Mar 17, 2025
What makes it unique? 📌 Handles everything a document has: tables, charts, code, equations, lists, and more 📌 Works beyond scientific papers—supports business docs, patents, and forms 📌 It runs with less than 1GB of RAM, so running at large batch sizes is super cheap!

View on Bluesky Download image Show all post labels
Andi andimara.bsky.social · Mar 17, 2025
At only 256M parameters, SmolDocling outperforms much larger models on key document conversion tasks: 🖋️ Full-page transcription: Beats models 27× bigger! 📑 Equations: Matches or beats leading models like GOT 💻 Code recognition: We introduce the first benchmark for code OCR

View on Bluesky Show all post labels
Andi andimara.bsky.social · Mar 17, 2025
SmolDocling is available today 🏗️ 🔗 Model: huggingface.co/ds4sd/SmolDo... 📖 Paper: huggingface.co/papers/2503.... 🤗 Space: huggingface.co/spaces/ds4sd... Try it and let us know what you think! 💬
ds4sd/SmolDocling-256M-preview · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

View on Bluesky Show all post labels

An unhandled error has occurred. Reload 🗙

ds4sd/SmolDocling-256M-preview · Hugging Face