- What if OCR models could show you their thought process? NuMarkdown-8B-Thinking from NuMind (YC S22) doesn't just extract text - it reasons through documents first. Could be pretty valuable for weird historical documents? Example here: davanstrien-ocr-time-capsule.static.hf.space/index.html?d...
- Model here: huggingface.co/numind/NuMar...
Aug 7, 2025 15:16
- Try it with one line of code via Jobs! It processes images from any dataset and outputs a new dataset with extracted markdown - all using HF GPUs. See the full OCR uv scripts collection: huggingface.co/datasets/uv-...