← heapsort-ai

voice AI

46 items

ARTICLEDEV.to AI·4/15/2026

Building Mini Gravity: A Local, Private Voice AI Agent

This content introduces Mini Gravity, a local and private voice AI agent designed to run entirely on a user's machine, capable of handling documents and generating code. It details a three-layer architecture (STT, Intent, Execution) using technologies like Groq's Whisper and DeepSeek-Coder, highlighting the importance of robust logic and prompt engineering.

59
ARTICLEDEV.to AI·4/21/2026

Building a Voice-First AI Tutor: Why Real-Time Audio Processing Changes Everything

This article discusses the fundamental differences and challenges in building a voice-first AI tutor like Ivy, designed for Ethiopian students, compared to traditional text-based chatbots. Key aspects include real-time audio processing, handling natural conversation flow, multi-language support (Amharic), low latency, and offline capabilities.

35
ARTICLEDEV.to AI·2d ago

I Built Yumii — An Open-Source AI Companion

Yumii is an open-source, locally-run AI companion featuring a Live2D avatar, real-time voice, and six personalities, now capable of long-term memory across sessions. It prioritizes accessibility with easy installation, comprehensive documentation, and a streamlined onboarding process, developed for the GitHub Finish-Up-A-Thon Challenge.

33
DOCDEV.to AI·4/16/2026

Voice Agent

This project details the creation of a Voice-Controlled Local AI Agent designed to process audio input, identify user intent, execute actions, and display results via a user interface. The system features a modular pipeline from audio input to UI output, ensuring scalability and flexibility.

31
ARTICLEDEV.to AI·4/13/2026

Building a Voice-Controlled AI Agent with FastAPI, Groq & Streamlit

This article details the creation of a voice-controlled AI agent, enabling users to interact with their computer for tasks like file creation or code writing. It covers the system's architecture, model selection, and a three-stage process involving Groq Whisper for STT and Groq Llama-3.1-8b for intent classification, all built with FastAPI and Streamlit.

28