Xenova

xenova.bsky.social

Followers · Following

Bringing the power of machine learning to the web. Currently working on Transformers.js (@huggingface 🤗)

Joined June 2023

Posts Replies Media Original posts Likes

Xenova xenova.bsky.social · Oct 26, 2025
As someone who learnt so much by watching @shiffman.lol's coding videos in high school, I never imagined that one day my own library would feature on his channel! 🥹 If you're interested in learning more about 🤗 Transformers.js, I highly recommend checking it out! 👉 www.youtube.com/watch?v=KR61...

View on Bluesky Download image Show all post labels

Xenova xenova.bsky.social · Aug 6, 2025
The next generation of AI-powered websites is going to be WILD! 🤯 In-browser tool calling & MCP is finally here, allowing LLMs to interact with websites programmatically. To show what's possible, I built a demo using Liquid AI's new LFM2 model, powered by 🤗 Transformers.js.

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Aug 6, 2025
As always, the demo is open source (which you can find under the "Files" tab), so I'm excited to see how the community builds upon this! 🚀 🔗 Link to demo: huggingface.co/spaces/Liqui...
LFM2 WebGPU – In-browser tool calling - a Hugging Face Space by LiquidAI

In-browser tool calling, powered by Transformers.js

huggingface.co

View on Bluesky Show all post labels

Xenova xenova.bsky.social · Jul 24, 2025
Introducing Voxtral WebGPU: State-of-the-art audio transcription directly in your browser! 🤯 🗣️ Transcribe videos, meeting notes, songs and more 🔐 Runs on-device, meaning no data is sent to a server 🌎 Multilingual (8 languages) 🤗 Completely free (forever) & open source

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Jul 24, 2025
That's right, we're running Mistral's new Voxtral-Mini-3B model 100% locally in-browser on WebGPU, powered by Transformers.js and ONNX Runtime Web! 🔥 Try it out yourself! 👇 huggingface.co/spaces/webml...
Voxtral WebGPU - a Hugging Face Space by webml-community

State-of-the-art audio transcription in your browser

huggingface.co

View on Bluesky Show all post labels

Xenova xenova.bsky.social · Jul 22, 2025
A community member trained a tiny Llama model (23M parameters) on 3 million high-quality @lichess.org games, then deployed it to run entirely in-browser with 🤗 Transformers.js! Super cool! 🔥 It has an estimated ELO of ~1400... can you beat it? 👀 (runs on both mobile and desktop)

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Jul 22, 2025
Model: huggingface.co/lazy-guy12/c... Online demo: lazy-guy.github.io/chess-llama/
lazy-guy12/chess-llama · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

View on Bluesky Show all post labels

Xenova xenova.bsky.social · Feb 7, 2025
We did it! Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. ⚡️ Generate 10 seconds of speech in ~1 second for $0. What will you build? 🔥

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Feb 7, 2025
The most difficult part was getting the model running in the first place, but the next steps are simple: ✂️ Implement sentence splitting, enabling streamed responses 🌍 Multilingual support (only phonemization left) Who wants to help? 🤗 huggingface.co/spaces/webml...
Kokoro Text-to-Speech (WebGPU) - a Hugging Face Space by webml-community

High-quality speech synthesis powered by Kokoro TTS

huggingface.co

View on Bluesky Show all post labels

Xenova xenova.bsky.social · Jan 16, 2025
Introducing Kokoro.js, a new JavaScript library for running Kokoro TTS, an 82 million parameter text-to-speech model, 100% locally in the browser w/ WASM. Powered by 🤗 Transformers.js. WebGPU support coming soon! 👉 npm i kokoro-js 👈 Link to demo (+ sample code) in 🧵

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Jan 16, 2025
You can get started in just a few lines of code! 🧑‍💻 Huge kudos to the Kokoro TTS community, especially taylorchu for the ONNX exports and Hexgrad for the amazing project! None of this would be possible without you all! 🤗 Try it out yourself: huggingface.co/spaces/webml...

View on Bluesky Download image Show all post labels
Xenova xenova.bsky.social · Jan 16, 2025
The model is also extremely resilient to quantization. The smallest variant is only 86 MB in size (down from the original 326 MB), with no noticeable difference in audio quality! 🤯 Link to models/samples: huggingface.co/onnx-communi...

View on Bluesky Download video Show all post labels

Xenova xenova.bsky.social · Jan 10, 2025
Is this the future of AI browser agents? 👀 WebGPU-accelerated reasoning LLMs are now supported in Transformers.js! 🤯 Here's MiniThinky-v2 (1B) running 100% locally in the browser at ~60 tps (no API calls)! I can't wait to see what you build with it! Demo + source code in 🧵👇

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Jan 10, 2025
For the AI builders out there: imagine what could be achieved with a browser extension that (1) uses a powerful reasoning LLM, (2) runs 100% locally & privately, and (3) can directly access/manipulate the DOM! 👀 💻 Source code: github.com/huggingface/... 🔗 Online demo: huggingface.co/spaces/webml...
Llama 3.2 Reasoning WebGPU - a Hugging Face Space by webml-community

Small and powerful reasoning LLM that runs in your browser

huggingface.co

View on Bluesky Show all post labels

Xenova xenova.bsky.social · Jan 1, 2025
First project of 2025: Vision Transformer Explorer I built a web app to interactively explore the self-attention maps produced by ViTs. This explains what the model is focusing on when making predictions, and provides insights into its inner workings! 🤯 Try it out yourself! 👇

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Jan 1, 2025
The app loads a small DINOv2 model into the user's browser and runs it locally using Transformers.js! 🤗 This means you can analyze your own images for free: simply click the image to open the file dialog. E.g., the model recognizes that long necks and fluffy ears are defining features of llamas! 🦙

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Jan 1, 2025
Vision Transformers work by dividing images into fixed-size patches (e.g., 14 × 14), flattening each patch into a vector and treating each as a token. It's fascinating to see what each attention head learns to "focus on". For example, layer 11, head 1 seems to identify eyes. Spooky! 👀

View on Bluesky Download image (1)Download image (2)Show all post labels
View full thread
Xenova xenova.bsky.social · Jan 1, 2025
This project was greatly inspired by Brendan Bycroft's amazing LLM Visualization tool – check it out if you haven't already! Also, thanks to Niels Rogge for adding DINOv2 w/ Registers to transformers! 🤗 Source code: github.com/huggingface/... Online demo: huggingface.co/spaces/webml...
Attention Visualization - a Hugging Face Space by webml-community

Vision Transformer Attention Visualization

huggingface.co

View on Bluesky Show all post labels

Xenova xenova.bsky.social · Dec 18, 2024
Introducing Moonshine Web: real-time speech recognition running 100% locally in your browser! 🚀 Faster and more accurate than Whisper 🔒 Privacy-focused (no data leaves your device) ⚡️ WebGPU accelerated (w/ WASM fallback) 🔥 Powered by ONNX Runtime Web and Transformers.js Demo + source code below! 👇

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Dec 18, 2024
Huge shout-out to the Useful Sensors team for such an amazing model and to Wael Yasmina for his 3D audio visualizer tutorial! 🤗 ‍💻 Source code: github.com/huggingface/... 🔗 Online demo: huggingface.co/spaces/webml...
Moonshine Web - a Hugging Face Space by webml-community

Real-time in-browser speech recognition

huggingface.co

View on Bluesky Show all post labels

Reposted by Xenova
Reuters Institute reutersinstitute.bsky.social · Dec 10, 2024
🤗NEW PIECE: ‘Open-source’ is becoming a buzzword for many aspects of modern journalism, including open-source AI. But what is it, and how can journalists benefit from it? @marinaadami.bsky.social spoke to @fdaudens.bsky.social to find out. reutersinstitute.politics.ox.ac.uk/news/journal...
This journalist wants you to try open-source AI: “AI is shiny, but value comes from the ideas people have to use it”

Hugging Face’s Florent Daudens on what open-source AI is, how journalists can use it and why he thinks they should.

reutersinstitute.politics.ox.ac.uk

View on Bluesky Show all post labels