See full post

Daniel Vila

dvilasuero.hf.co

Followers · Following

Everything datasets and human feedback for AI at Hugging Face. Prev: co-founder and CEO of Argilla (acquired by Hugging Face)

Joined October 2024

Posts Replies Media Original posts Likes Lists

Reposted by Daniel Vila
Florent Daudens fdaudens.bsky.social · Jan 28, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
sdiazlor sdiazlor.hf.co · Jan 20, 2025
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Natalia nataliaelv.hf.co · Jan 17, 2025
New chapter in the Hugging Face NLP course! 🤗 🚀 We've added a new chapter about the very basics of Argilla to the Hugging Face NLP course. Learn how to set up an Argilla instance, load & annotate datasets, and export them to the Hub. Any feedback for improvements welcome!

View on Bluesky Download image Show all post labels

Reposted by Daniel Vila
Daniel van Strien danielvanstrien.bsky.social · Jan 16, 2025
[This post could not be retrieved]

View on Bluesky Show all post labels

Reposted by Daniel Vila
David Berenstein davidberenstein.bsky.social · Jan 7, 2025
High-quality data for fine-tuning language models for free and at the click of a button! Prompt and wait for your dataset to push to Argilla or the Hub Evaluate, review and fine-tune a model. Blog:
Fine-tune a SmolLM on domain-specific synthetic data from a LLM

A Blog post by David Berenstein on Hugging Face

buff.ly

View on Bluesky Show all post labels

Reposted by Daniel Vila
Daniel van Strien danielvanstrien.bsky.social · Jan 3, 2025
[This post could not be retrieved]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Daniel van Strien danielvanstrien.bsky.social · Jan 6, 2025
[This post could not be retrieved]

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Dec 20, 2024
💥 Ending 2024: A full data annotation journey on the Hugging Face Hub—from raw data to training-ready datasets! With Argilla 2.6.0, push your data to the Hub from the UI Let’s make 2025 the year anyone can build more transparent and accountable AI—no coding or model skills needed.

View on Bluesky Download video Show all post labels
Daniel Vila dvilasuero.hf.co · Dec 20, 2024
Release notes: github.com/argilla-io/a...

View on Bluesky Show all post labels
Daniel Vila dvilasuero.hf.co · Dec 20, 2024
Get started: docs.argilla.io/latest/getti...
Quickstart - Argilla Docs

Get started with Argilla in less 10 minutes

docs.argilla.io

View on Bluesky Show all post labels

Reposted by Daniel Vila
jfcalvo jfcalvo.hf.co · Dec 19, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
David Berenstein davidberenstein.bsky.social · Dec 17, 2024
🔥 We got great feedback on this: "Synthetic Data Generator" A no-code tool to create datasets with LLMs, making it a breeze, allowing ANYONE to create datasets and models in minutes and without any code. Blog: buff.ly/4gybyoT GitHub: buff.ly/49IDSmd Space: buff.ly/3Y1S99z
Introducing the Synthetic Data Generator - Build Datasets with Natural Language

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

buff.ly

View on Bluesky Show all post labels

Reposted by Daniel Vila
ashvanths ashvanths.bsky.social · Dec 14, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
johko johko.bsky.social · Dec 13, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
moritzlaurer moritzlaurer.bsky.social · Dec 12, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Ben Burtenshaw benburtenshaw.bsky.social · Dec 12, 2024
Desperate to contribute to the development of Scots language AI. I've just contributed 16 examples to this dataset: data-is-better-together-fineweb-c.hf.space/share-your-p...
sco - Scots - Scots

Join and contribute to the dataset sco - Scots - Scots

data-is-better-together-fineweb-c.hf.space

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Dec 12, 2024
I've just contributed 156 examples to the FineWeb 2 Spanish dataset: data-is-better-together-fineweb-c.hf.space/share-your-p... If you want to contribute, sign in with @hf.co and find your language
spa - español - Spanish

Join and contribute to the dataset spa - español - Spanish

data-is-better-together-fineweb-c.hf.space

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Dec 10, 2024
Help shape the future of multilingual Open Source AI! Join the FineWeb 2 Community Annotation Sprint to create an open training dataset with full transparency and human validation in many languages. Review datasets in your language and help identify the best sources for training.

View on Bluesky Download image Show all post labels
Daniel Vila dvilasuero.hf.co · Dec 10, 2024
Join this Space, search for your language, and start contributing: huggingface.co/spaces/data-... Don't know how to start, want to discuss? Join: huggingface.co/spaces/Huggi...
FineWeb-c - Annotation - a Hugging Face Space by data-is-better-together

Discover amazing ML apps made by the community

huggingface.co

View on Bluesky Show all post labels

Reposted by Daniel Vila
frascuchon frascuchon.bsky.social · Dec 3, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
David Berenstein davidberenstein.bsky.social · Dec 9, 2024
👐 Open Image Preferences is an Apache 2.0 licensed dataset for text-to-image generation by the @hf.co community. This dataset contains 10K text-to-image preference pairs across image generation categories, using different model families and prompt complexities. Blog: huggingface.co/blog/image-p...
Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

View on Bluesky Show all post labels

Reposted by Daniel Vila
sdiazlor sdiazlor.hf.co · Dec 9, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Dec 6, 2024
Announcing Global-MMLU - an improved MMLU Open dataset with evaluation coverage across 42 languages. The result of months of work with the goal of advancing Multilingual LLM evaluation. Built together with the community and amazing collaborators at Cohere4AI, MILA, MIT, and many more.

View on Bluesky Download image Show all post labels
Daniel Vila dvilasuero.hf.co · Dec 6, 2024
Open dataset: huggingface.co/datasets/Coh... Paper: arxiv.org/pdf/2412.03304
CohereForAI/Global-MMLU · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Dec 3, 2024
We're about to launch the biggest collaboration effort since the Open Assistant. Let's get the highest quality data for open foundation models with all the nuances & diversity of each language, all with data provenance and transparency Join us as language lead: docs.google.com/forms/d/10XI...
Language Lead sign-up

At Hugging Face 🤗, we're launching a big community initiative to improve LLM training for many languages. We're looking for Language Leads to help us cultivate specific languages during this initiativ...

docs.google.com

View on Bluesky Show all post labels

Reposted by Daniel Vila
Natalia nataliaelv.hf.co · Dec 3, 2024
Next week we're launching a collaborative annotation effort to build a big multilingual dataset, so you can have high-quality data in your language. We are really close to getting leads for 100 languages! Can you help us cover the remaining 200?

View on Bluesky Download image Show all post labels

Reposted by Daniel Vila
Ben Burtenshaw benburtenshaw.bsky.social · Dec 3, 2024
For anyone interested in fine-tuning or aligning LLMs, I’m running this free and open course called smol course. It’s not a big deal, it’s just smol. 🧵>>

View on Bluesky Download image Show all post labels

Reposted by Daniel Vila
jfcalvo jfcalvo.hf.co · Dec 2, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Ben Burtenshaw benburtenshaw.bsky.social · Nov 30, 2024
[SATURDAY THREAD] ☕️ 🧑‍🎓 In case you spent the week reading GDPR legislation and missed everything. It’s all about vision language models and image preference datasets. >> 🧵 Here are the models and datasets you can use in your projects.

View on Bluesky Show all post labels

Reposted by Daniel Vila
damianpumar damianpumar.hf.co · Nov 26, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
damianpumar damianpumar.hf.co · Nov 29, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
stellaathena stellaathena.bsky.social · Nov 28, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
mmitchell mmitchell.bsky.social · Nov 27, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Daniel van Strien danielvanstrien.bsky.social · Nov 27, 2024
[This post could not be retrieved]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Ben Burtenshaw benburtenshaw.bsky.social · Nov 26, 2024
The community has labelled over 3000 image preferences in a few hours. One open source image preferences dataset coming right up!
- Ben Burtenshaw benburtenshaw.bsky.social · Nov 26, 2024
  [Not loaded yet]
View on Bluesky Download image Show all post labels

Reposted by Daniel Vila
Florent Daudens fdaudens.bsky.social · Nov 26, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Natalia nataliaelv.hf.co · Nov 26, 2024
At @huggingface.bsky.social 🤗 we're preparing a collaborative annotation effort to build an open-source multilingual dataset. If you'd like to get high-quality open data for your language, check if yours is listed in this form and sign up! forms.gle/DHJdtvoSNxAA...
Language Lead sign-up

At Hugging Face 🤗, we're launching a big community initiative to improve LLM training for many languages. We're looking for Language Leads to help us cultivate specific languages during this initiativ...

forms.gle

View on Bluesky Show all post labels

Reposted by Daniel Vila
Daniel van Strien danielvanstrien.bsky.social · Nov 26, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
jfcalvo jfcalvo.hf.co · Nov 26, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
ameeelie ameeelie.bsky.social · Nov 26, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Nov 26, 2024
Super excited to launch the Open Images Preferences @huggingface.bsky.social community sprint Have fun browsing images generated with the latest OSS models while contributing to the future of Open Source AI 🧵

View on Bluesky Download image Show all post labels
Daniel Vila dvilasuero.hf.co · Nov 26, 2024
Find all the details in the blog post, you just need to sign in and start choosing the images you prefer. huggingface.co/blog/burtens...
Let’s make a generation of amazing image generation models

A Blog post by ben burtenshaw on Hugging Face

huggingface.co

View on Bluesky Show all post labels

Reposted by Daniel Vila
markcollier markcollier.me · Nov 24, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
ashvanths ashvanths.bsky.social · Nov 26, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Nov 26, 2024
Let's make AI more inclusive. At @huggingface.bsky.social we'll launch a huge community sprint soon to build high-quality training datasets for many languages. We're looking for Language Leads to help with outreach. Find your language and nominate yourself: forms.gle/iAJVauUQ3FN8...

View on Bluesky Download video Show all post labels
Daniel Vila dvilasuero.hf.co · Nov 26, 2024
Contributing to the task itself will be easy as well, with no programming skills required, just reading short documents in the language and rating them according to their educational quality for training AI models

View on Bluesky Show all post labels

Reposted by Daniel Vila
witko witko.bsky.social · Nov 25, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Daniel van Strien danielvanstrien.bsky.social · Nov 25, 2024
[This post could not be retrieved]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Anthony A. Gatti aagatti.bsky.social · Nov 25, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Nov 25, 2024
Interested in open datasets for ML and AI? I've just created this feed with posts about @huggingface.bsky.social datasets! Don't miss the latest news and conversations about the secret sauce behind every AI model. bsky.app/profile/dvil...
Feed: 🤗 Hugging Face Datasets

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Nov 25, 2024
This is super useful for NLP lovers like myself
- Maria Antoniak mariaa.bsky.social · Nov 25, 2024
  [This post could not be retrieved]
View on Bluesky Show all post labels

Reposted by Daniel Vila
Philipp Schmid philschmid.bsky.social · Nov 25, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Ben Burtenshaw benburtenshaw.bsky.social · Nov 25, 2024
TRL is a cornerstone of LLM post training and imo it's the default to learn. There are great alternatives like Unsloth, Axolotl, and AutoTrain. But if you want a daily drive that does experimentation to production, it's TRL. 🧵 these community notebooks guide you through TRL's core:

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Nov 24, 2024
I am very excited to launch a new community initiative next week. Let's build the largest open community dataset to evaluate and improve image generation models. Follow: huggingface.co/data-is-bett... And stay tuned here
data-is-better-together (Data Is Better Together)

Building better datasets together

huggingface.co

View on Bluesky Show all post labels

Reposted by Daniel Vila
Hugging Face hf.co · Nov 22, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Ben Burtenshaw benburtenshaw.bsky.social · Nov 23, 2024
In case you passed out and woke up on saturday lunch. Small models and high quality data are back! ... if they ever left 🤔 - SmolTalk dataset from @huggingface.bsky.social - Tulu 3 models and datasets from @ai2.bsky.social - Nvidia Nymba model from @nvidiastudio.bsky.social

View on Bluesky Show all post labels

Reposted by Daniel Vila
elifennellphd elifennellphd.bsky.social · Nov 23, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Daniel Vila dvilasuero.hf.co · Nov 22, 2024
📣 @huggingface.bsky.social important 🦋 updates: 🚀 You can now add your handle on your HF profile for others to find you on 🦋 huggingface.co/settings/pro... ❤️ We have just updated the list of Hugging Face Folks: bsky.app/starter-pack...
at://did:plc:qcm5pejjqltepp6kztn6pzib/app.bsky.graph.starterpack/3laz5x7naiz22

View on Bluesky Show all post labels

Reposted by Daniel Vila
Brigitte brigittetousi.hf.co · Nov 22, 2024
[This post could not be retrieved]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Tom Aarsen tomaarsen.com · Nov 22, 2024
Don't forget to set your Bluesky account in your @huggingface.bsky.social profile! Instructions in 🧵

View on Bluesky Download image Show all post labels

Reposted by Daniel Vila
Daniel van Strien danielvanstrien.bsky.social · Nov 22, 2024
[This post could not be retrieved]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Daniel van Strien danielvanstrien.bsky.social · Nov 22, 2024
[This post could not be retrieved]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Philipp Schmid philschmid.bsky.social · Nov 22, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
catherinebreslin catherinebreslin.bsky.social · Nov 22, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
Nathan Lambert natolambert.bsky.social · Nov 22, 2024
[This post could not be retrieved]

View on Bluesky Show all post labels

Reposted by Daniel Vila
osanseviero osanseviero.bsky.social · Nov 22, 2024
[Not loaded yet]

View on Bluesky Show all post labels

Reposted by Daniel Vila
ameeelie ameeelie.bsky.social · Nov 22, 2024
[Not loaded yet]

View on Bluesky Show all post labels

An unhandled error has occurred. Reload 🗙