Qwen3 TTS is seriously underrated - I got it running locally in real-time and it's one of the most expressive open TTS models I've tried
The author revisited an old real-time, local ASR->LLM->TTS pipeline project and was pleasantly surprised by Qwen3 TTS. After significant experimentation, they managed to get Qwen3 TTS working reliably for local streaming, praising its expressiveness and suitable architecture.
