← heapsort-ai

audio

4 items

NEWS↑ trendingReddit r/LocalLLaMA·4/12/2026

mtmd: qwen3 audio support (qwen3-omni and qwen3-asr)

The Qwen3 model now supports audio input through its `qwen3-omni-moe` (multimodal with vision and audio input) and `qwen3-asr` (audio speech recognition) versions. GGUF models for Qwen3-Omni (30B variants) and Qwen3-ASR (1.7B and 0.6B) are available on Hugging Face for community use.

mtmd: qwen3 audio support (qwen3-omni and qwen3-asr)
42
ARTICLEDEV.to AI·25d ago

The AI Voiceover That Doesn't Sound Like a Robot

This article discusses how to create engaging AI voiceovers that avoid sounding robotic, emphasizing the integration of the voice with visuals. It highlights tools like ElevenLabs and the use of SSML for precise control over pacing, tone, and emphasis, treating the voice as the director of visual content.

26