See full post

Xuan Son Nguyen

Followers · Following

Software Engineer @ Hugging Face 🤗

Joined November 2024

Posts Replies Media Original posts Likes

Xuan Son Nguyen ngxson.hf.co · Oct 5, 2025
Very nice touch, Gmail 😅

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Aug 29, 2025
Part 2 of my journey building a smart home! 🚀 In this part: > ESPHome & custom component > RF433 receiver & transmitter > Hassio custom addon

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Aug 29, 2025
Link to article: blog.ngxson.com/building-my-...
Building My Smart Home - Part 2: ESPHome & RF433

Implementing an RF433 gateway using ESPHome and a custom Home Assistant add-on to manage multiple RF433 receivers for my smart home.

blog.ngxson.com

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Aug 27, 2025
Just published a new article on my blog 🏃‍♂️ Building My Smart Home - Part 1: Plan, Idea & Home Assistant Check it out!

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Aug 27, 2025
Link to article: blog.ngxson.com/building-my-...
Building My Smart Home - Part 1: Home Assistant

Designing a smart home system from electrical wiring to Home Assistant automations, using affordable devices and network-inspired architecture.

blog.ngxson.com

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Aug 14, 2025
Kudos to Google and the llama.cpp team! 🤝 GGUF support for Gemma 270M right from day-0

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Aug 14, 2025
Link here: huggingface.co/collections/...
Gemma 3-270m - a ggml-org Collection

Collection of models for Gemma 3-270m

huggingface.co

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Jul 21, 2025
Richy Mini and SmolLM3 are featured in Github's weekly news! 🚀 🚀

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Jul 21, 2025
Watch it here: www.youtube.com/watch?v=Qtzz...

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Jun 26, 2025
Gemma 3n has arrived in llama.cpp 👨‍🍳 🍰 Comes in 2 flavors: E2B and E4B (E means "effective/active parameters")

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Jun 11, 2025
See you this Sunday at AI Plumbers conference: 2nd edition! 📍 Where: GLS Event Campus Berlin, Kastanienallee 82 | 10435 Berlin 👉 Register here: lu.ma/vqx423ct

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Jun 3, 2025
✨✨ AIFoundry is bringing you the AI Plumbers Conference: 2nd edition — an open source meetup for low-level AI builders to dive deep into "the plumbing" of modern AI 📍 Where: GLS Event Campus Berlin, Kastanienallee 82 | 10435 Berlin 📅 When: June 15, 2025 👉 Register now: lu.ma/vqx423ct

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · May 15, 2025
Hugging Face Inference Endpoints now officially support deploying **vision** models via llama.cpp 👀 👀 Try it now: endpoints.huggingface.co/catalog

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · May 12, 2025
Real-time webcam demo with @huggingface.bsky.social SmolVLM and llama.cpp server. All running locally on a Macbook M3

View on Bluesky Download video Show all post labels
Xuan Son Nguyen ngxson.hf.co · May 12, 2025
Check it out: github.com/ngxson/smolv...
GitHub - ngxson/smolvlm-realtime-webcam

Contribute to ngxson/smolvlm-realtime-webcam development by creating an account on GitHub.

github.com

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Apr 25, 2025
Although we have A100, H200, M3 Ultra, etc Still can't match the power of that Casio FX 😆

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Apr 21, 2025
llama.cpp vision support just got much better! 🚀 Traditionally, models with complicated chat template like MiniCPM-V or Gemma 3 requires a dedicated binary to run. Now, you can use all supported models via a "llama-mtmd-cli" 🔥 (Only Qwen2VL is not yet supported)

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Apr 20, 2025
Finally have time to write a blog post about ggml-easy! 😂 ggml-easy is a header-only wrapper for GGML, simplifies development with a cleaner API, easy debugging utilities, and native safetensors loading ✨ Great for rapid prototyping!

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Apr 20, 2025
Learn more: blog.ngxson.com/introducing-...
https://blog.ngxson.com/introducing-ggml-easy

blog.ngxson.com

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Apr 20, 2025
Someone at Google definitely had a lot of fun making this 😆 And if you don't know, it's available in "Starter apps" section on AI Studio. The app is called "Gemini 95"

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Apr 20, 2025
Telling LLM memory requirement WITHOUT a calculator? Just use your good old human brain 🧠 😎 Check out my 3‑step estimation 🚀

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Apr 19, 2025
Google having a quite good sense of humor 😂 Joke aside, 1B model quantized to Q4 without performance degrading is sweet 🤏

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 31, 2025
Cooking a fun thing today, I can now load safetensors file directly to GGML without having to convert it to GGUF! Why? Because this allow me to do experiments faster, especially with models outside of llama.cpp 😆

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Mar 31, 2025
Where to try? ggml-easy --> github.com/ngxson/ggml-...
GitHub - ngxson/ggml-easy: Thin wrapper around GGML to make life easier

Thin wrapper around GGML to make life easier. Contribute to ngxson/ggml-easy development by creating an account on GitHub.

github.com

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 30, 2025
No vibe coding. Just code it ✅ Visit my website --> ngxson.com

View on Bluesky Download video Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 20, 2025
On Monday, the 24th, I'm proud to give a talk at sota's webinar. My main talk will last for an hour to deep dive into the current state of on-device LLMs, exploring their advantages, trade-offs, and limitations. The session will end with an Q&A, where you can ask me anything about this subject.

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Mar 20, 2025
📅 The Live Webinar will happen at 🕔 11 AM SF — 2 PM NYC — 6 PM London — 19h00 Paris 👉👉👉 Register here: app.getcontrast.io/register/sot... 👈👈👈
The State of On-Device LLMs

Xuan-Son Nguyen, an engineer at Hugging Face, specializes in on-device large language models (LLMs) and runtime optimization, working extensively with llam...

app.getcontrast.io

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 19, 2025
Had a fantastic chat today with Georgi Gerganov, the brilliant mind behind ggml, llama.cpp, and whisper.cpp! We discussed about: 🚀 The integration of vision models into llama.cpp 🚀 The challenges of maintaining a smooth UX/DX 🚀 The exciting future of llama.cpp Big things ahead - stay tuned!

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 13, 2025
OK now you are the best, Gememe 2.0

View on Bluesky Download image (1)Download image (2)Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 12, 2025
Wanna try Gemma 3 vision with llama.cpp? There is a playground for that! More in 🧵

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Mar 12, 2025
Follow the guide here: github.com/ggml-org/lla...
Experimental support for Gemma 3 vision · ggml-org llama.cpp · Discussion #12348

I mirror the guide from #12344 for more visibility. To support Gemma 3 vision model, a new binary llama-gemma3-cli was added to provide a playground, support chat mode and simple completion mode. I...

github.com

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 12, 2025
Day-zero Gemma 3 support in llama.cpp 🤯 👉 4 model sizes: 1B, 4B, 12B, 27B 👉 Vision capability (except for 1B) with bi-direction attention 👉 Context size: 32k (1B) and 128k (4B, 12B, 27B) 👉 +140 languages support (except for 1B) 👉 Day-zero support on many frameworks 🚀

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Mar 12, 2025
Huge thanks for Hugging Face and Google for supporting me with the llama.cpp implementation ❤️ More info: huggingface.co/blog/gemma3
Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 10, 2025
Aya Vision is now the number one trending OCR model on Hugging Face 🚀 👉 Comes in 2 sizes, 8B and 32B 👉 Supports 32 languages 👉 Day-zero support with HF Transformers

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Mar 10, 2025
Try via this space: huggingface.co/spaces/prith...
OCR - a Hugging Face Space by prithivMLmods

perfect ocr vlm

huggingface.co

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 8, 2025
Did you know? A number of 🤗 Hugging Face's blog posts now feature AI-created podcasts 🎙️ This offers an alternative way to absorb extensive and intricate articles 🔍

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 6, 2025
Qwen/QwQ-32B has just arrived on Hugging Chat! Try it now: huggingface.co/chat/models/...

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 6, 2025
CogView-4 is out 🔥🚀 The SoTa OPEN text to image model by ZhipuAI Demo: huggingface.co/spaces/THUDM... ✨ 6B with Apache2.0 ✨ Supports Chinese & English Prompts by ANY length ✨ Generate Chinese characters within images ✨ Creates images at any resolution within a given range

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 5, 2025
Wondering how much RAM is needed to run a given GGUF? Try: npx @huggingface/gguf [model].gguf This also work with remote file, for example: npx @huggingface/gguf https: //huggingface.co/bartowski/Qwen_QwQ-32B-GGUF/resolve/main/Qwen_QwQ-32B-Q4_K_M.gguf

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 5, 2025
Apple unveils the M3 Ultra chip, support up to 512GB unified money, oops sorry, unified memory. Perfect for pro workflows and AI development 👀 Read more: www.apple.com/newsroom/202...

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 5, 2025
Baby wake up! The Hugging Face Reasoning Course is out 🚀 Huge thanks to Maxime Labonne for building the first practical example in the reasoning course. Link: huggingface.co/reasoning-co...

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 5, 2025
DiffRhythm's Revolutionary Music Generation 🎵🎵 🚀 Lightning-Fast Production: Create full-length songs with vocals in under 10 seconds! ⚡ Non-Autoregressive Structure: build on top of variational autoencoder (VAE) 🤏 Small: Both VAE + Base model combined is < 2.5GB 🌍 Open-Source model code + weights

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Mar 5, 2025
Demo: huggingface.co/spaces/ASLP-...
Di♪♪Rhythm - a Hugging Face Space by ASLP-lab

Blazingly Fast and Embarrassingly Simple Song Generation

huggingface.co

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 5, 2025
With the new 🐸 JFrog 's model scanner on the 🤗 Hugging Face hub, we're making running AI models even more secured for everyone!

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 3, 2025
EgoLife: An AI-Powered Egocentric Life Assistant Key details: > Open-source dataset: 300+ hrs egocentric, multimodal data > 3K long-context QAs for daily insights > Open-source models: EgoGPT & EgoRAG for smart recall Turning real-life moments into personalized AI help!

View on Bluesky Download video Show all post labels
Xuan Son Nguyen ngxson.hf.co · Mar 3, 2025
More details: huggingface.co/collections/...
EgoLife - a lmms-lab Collection

CVPR 2025 - EgoLife: Towards Egocentric Life Assistant. Homepage: https://egolife-ai.github.io/

huggingface.co

View on Bluesky Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 3, 2025
Disassemble Phi-4-multimodal-instruct: > Minimal Vision encoder: 440M/460M respectively > Projector: 2-layer MLP for both modalities > Language model: Phi-4-mini 3.3B parameters > LoRA adapters for Vision/Audio decoder, applied on top of Phi-4-mini

View on Bluesky Download image Show all post labels

Xuan Son Nguyen ngxson.hf.co · Mar 2, 2025
This weekend, I found a very fun project from the community: TinyLM 🤏 Built around transformers.js 🚀 , this project aims to provide to the developer a straight-forward API to work with LLM And most important, it can run inference on-browser using WebGPU or wasm 🚀 No server is needed!

View on Bluesky Download image Show all post labels
Xuan Son Nguyen ngxson.hf.co · Mar 2, 2025
Try it here: tinylm.wizenheimer.dev
tinylm - Run Models Locally with WebGPU

tinylm.wizenheimer.dev

View on Bluesky Show all post labels

An unhandled error has occurred. Reload 🗙