← heapsort-ai

Speech-to-Text

44 items

ARTICLEDEV.to AI·4/15/2026

voice- Agent model

This article details the creation of a modern, responsive Voice-Controlled AI Agent, capable of understanding context and performing complex technical tasks. It outlines the architecture, which includes leveraging the Groq LPU Inference Engine and Whisper Large V3 for ultra-fast Speech-to-Text transcription.

27
ARTICLEDEV.to AI·7d ago

Transcription accuracy vs. transcription quality: why the gap matters

This article discusses the critical distinction between transcription accuracy, often measured by Word Error Rate (WER), and perceived transcription quality. It argues that while WER quantifies correct words, it fails to account for user satisfaction, which is significantly impacted by elements like speaker labeling, formatting, and punctuation, thus creating a "perceived quality gap."

27
ARTICLEDEV.to AI·5/7/2026

Why I switched from Dragon NaturallySpeaking to Whisper API (and built my own app)

The author explains why they switched from Dragon NaturallySpeaking to Whisper API for speech-to-text, despite Dragon's long-standing reputation. The post aims to help others evaluate modern speech-to-text options for professional use, detailing Dragon's strengths like on-device processing, commands, long-session accuracy, and Windows integration.

27
ARTICLEDEV.to AI·4/15/2026

Aisha AI: Complete Resource Guide — 100 Official Links for Uzbekistan's Leading AI Platform

This content introduces Aisha AI, Central Asia's fastest-growing AI platform, specializing in Uzbek-language speech synthesis, speech-to-text, chatbots, and voice agents. The guide provides 100 official links covering products, documentation, and applications across various sectors, driving digital transformation in the region.

27