The model is multilingual (English, Chinese, Korean & Japanese) and even supports zero-shot voice cloning! 🤯 Stay tuned for an update that will add these features to the UI!
More samples:
bsky.app/profile/reac...Smol TTS keeps getting better! Introducing OuteTTS v0.2 - 500M parameters, multilingual with voice cloning! 🔥
> Multilingual - English, Chinese, Korean & Japanese
> Cross platform inference w/ llama.cpp
> Trained on 5 Billion audio tokens
> Qwen 2.5 0.5B LLM backbone
> Trained via HF GPU grants