Xenova: This project was greatly inspired by Brendan Bycroft's amazing LLM Visualization tool – check it out if you haven't already! Also, thanks to Niels Rogge for adding DINOv2 w/ Registers to transformers! 🤗 Source code: github.com/huggingface/... Online demo: huggingface.co/spaces/webml...

See full post

Xenova xenova.bsky.social · Jan 1, 2025
First project of 2025: Vision Transformer Explorer I built a web app to interactively explore the self-attention maps produced by ViTs. This explains what the model is focusing on when making predictions, and provides insights into its inner workings! 🤯 Try it out yourself! 👇

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Jan 1, 2025
The app loads a small DINOv2 model into the user's browser and runs it locally using Transformers.js! 🤗 This means you can analyze your own images for free: simply click the image to open the file dialog. E.g., the model recognizes that long necks and fluffy ears are defining features of llamas! 🦙

View on Bluesky Download video Show all post labels
Xenova xenova.bsky.social · Jan 1, 2025
Vision Transformers work by dividing images into fixed-size patches (e.g., 14 × 14), flattening each patch into a vector and treating each as a token. It's fascinating to see what each attention head learns to "focus on". For example, layer 11, head 1 seems to identify eyes. Spooky! 👀

View on Bluesky Download image (1)Download image (2)Show all post labels
Xenova xenova.bsky.social · Jan 1, 2025
Another interesting thing to see is how the attention maps become far more refined in later layers of the transformer. For example, First layer (1) – noisy and diffuse, capturing broad general patterns. Last layer (12) – focused and precise, highlighting specific features.

View on Bluesky Download image (1)Download image (2)Show all post labels
Xenova xenova.bsky.social
This project was greatly inspired by Brendan Bycroft's amazing LLM Visualization tool – check it out if you haven't already! Also, thanks to Niels Rogge for adding DINOv2 w/ Registers to transformers! 🤗 Source code: github.com/huggingface/... Online demo: huggingface.co/spaces/webml...
Attention Visualization - a Hugging Face Space by webml-community

Vision Transformer Attention Visualization

huggingface.co
Jan 1, 2025 15:37
0 reposts 0 quotes 0 likes

View on Bluesky Show all post labels

An unhandled error has occurred. Reload 🗙

Attention Visualization - a Hugging Face Space by webml-community