Natalia
Building Argilla @ Hugging Face 🤗. Linguist at heart. En ocasiones escribo en castellano.
- New chapter in the Hugging Face NLP course! 🤗 🚀 We've added a new chapter about the very basics of Argilla to the Hugging Face NLP course. Learn how to set up an Argilla instance, load & annotate datasets, and export them to the Hub. Any feedback for improvements welcome!
- Start learning here: huggingface.co/learn/nlp-co...
- Reposted by Natalia🚀 Argilla v2.6.0 is here! 🎉 Let me show you how EASY it is to export your annotated datasets from Argilla to the Hugging Face Hub. 🤩 Take a look to this quick demo 👇 💁♂️ More info about the release at github.com/argilla-io/a... #AI #MachineLearning #OpenSource #DataScience #HuggingFace #Argilla
- I'm taking a well-deserved break to celebrate Christmas 🎄 ☃️ but the FineWeb2 annotation sprint continues! You can still contribute some annotations or start leading a language!
- Links: - Reach us out: buff.ly/4gC2f7p - Do some annotations: buff.ly/4gFuguL - Not sure how to annotate? See this video guide: buff.ly/4gJ9Xg9
- If you are still wondering how the FineWeb2 annotations are done, how to follow the guidelines or how Argilla works, this is your video! I go through a few samples of the FineWeb2 dataset and classify them based on their educational content. Check it out!
- The FineWeb2 collaborative annotation sprint is also a way of keeping many languages alive. I talk about it in this LinkedIn post: buff.ly/49DghmN
- I've just contributed 142 examples to this dataset: data-is-better-together-fineweb-c.hf.space/share-your-p...
- Next week we're launching a collaborative annotation effort to build a big multilingual dataset, so you can have high-quality data in your language. We are really close to getting leads for 100 languages! Can you help us cover the remaining 200?
- Check if we're still looking for leads in your language: nataliaelv-language-leads-dashboard.hf.space Sign up: forms.gle/opx2CZUEza1r...
- Reposted by Natalia🙌 I just wanted to share a few thoughts about the latest Argilla release, 2.5.0, as it's a pretty big one! Argilla now has full support for webhooks, which means you can do some pretty cool stuff, like model training on the fly as annotations are created. 🤯 #MachineLearning #NLP #DataLabeling
- This is what you get in Bluesky when your feeds are Linguistics and otters 🦦😍
- At @huggingface.bsky.social 🤗 we're preparing a collaborative annotation effort to build an open-source multilingual dataset. If you'd like to get high-quality open data for your language, check if yours is listed in this form and sign up! forms.gle/DHJdtvoSNxAA...
- Reposted by Natalia[Not loaded yet]
- Reposted by NataliaI created a collection with good models for dataset curation - NSFW classifiers - PII classifiers - blazing fast embeddings by model2vec - quality classifier - educational value classifier - domain classifier Collection: huggingface.co/collections/...
- Hello everyone! 👋 Since this is growing quite a bit, I thought I'd introduce myself: I'm Natalia, a computational linguist working at @huggingface.bsky.social as part of the team building Argilla.
- I like posting about super-high-quality data curation for AI, languages (modern and ancient!) and linguistics. If you'd like to follow my work on other platforms, you can find more links here: buff.ly/3OiuHPH
- Back to work after a week-long offsite in Martinique 🏝️ with my colleagues from @huggingface.bsky.social 🤗 ! I had time to relax, reflect, have fun and meet people who aren't just amazing at their work but also truly kind 💖 Can't wait for the next one!
- What's your strategy to save interesting posts and not forget about their existence?
- Reposted by Natalia[This post could not be retrieved]