Blog

Blog Thumbnail
iOS Real-Time Noise Suppression SDK: Swift Tutorial
January 21, 2026 · 3 min read

Integrate real-time noise suppression into your iOS app with the Koala iOS SDK. On-device noise supression for real-time communication apps.

Blog Thumbnail
ML Kit Android Speech-to-Speech Translation: Complete Kotlin Tutorial
January 21, 2026 · 5 min read

Build ML Kit Android speech-to-speech translation with Kotlin. Complete guide using Cheetah STT, Google ML Kit Translation, and Orca TTS for on-device voice translation.

Blog Thumbnail
Build a Real-Time Meeting Summarization Tool in Python
January 19, 2026 · 2 min read

Complete guide to building a real-time meeting summarization tool in Python with streaming speech-to-text and AI summaries. Full code included.

Blog Thumbnail
Build an AI Voice Note-Taking App with Python
January 19, 2026 · 2 min read

Complete guide to building a voice note-taking app in Python with wake word activation, stop phrase control, and on-device transcription. Full code included.

Blog Thumbnail
How to Play Audio in Python: PCM Audio Playback Guide
January 19, 2026 · 4 min read

Learn how to play audio in Python with PvSpeaker. Stream PCM audio output for text-to-speech, audio synthesis, and real-time audio playback on Windows, macOS, and Linux.

Blog Thumbnail
How to Record Audio in React Native for iOS and Android
January 19, 2026 · 3 min read

Learn how to record audio in React Native apps for Android and iOS. Capture PCM microphone input for speech recognition, voice commands, and real-time audio processing.

Blog Thumbnail
Automatic Punctuation and Truecasing with Python Speech-to-Text
January 16, 2026 · 3 min read

Learn how to enable automatic punctuation and correct casing in speech-to-text with Python. Get formatted transcripts with periods, commas, and capitalization.

Blog Thumbnail
Complete Guide to Building HIPAA-Compliant Medical Voice AI Agent
January 16, 2026 · 4 min read

Build HIPAA-compliant medical voice agent in Python with on-device speech processing. Complete tutorial with wake word detection, real-time STT, and TTS.

Blog Thumbnail
Run Local Large Language Models in C: Cross-Platform LLM Inference
December 23, 2025 · 5 min read

Learn how to run LLMs locally in C across Linux, Windows, macOS, and Raspberry Pi with streaming text generation.

Blog Thumbnail
Step-by-Step Guide: Add Speaker Diarization to OpenAI Whisper in C++
December 19, 2025 · 5 min read

Step-by-step guide to adding speaker diarization to OpenAI Whisper STT in C++ using Falcon Speaker Diarization for multi-speaker transcription.

Blog Thumbnail
Complete Tutorial: Voice Activity Detection in C
December 18, 2025 · 3 min read

Voice Activity Detection (VAD) is a core building block for speech and audio systems, used to determine when human speech is present in an audio stream.

Blog Thumbnail
How to Implement Cross-Platform Noise Suppression in C
December 18, 2025 · 4 min read

Learn how to implement real-time noise cancellation in C across Linux, Windows, macOS, and Raspberry Pi.

Blog Thumbnail
Complete MCP Tutorial: How to Build a Local MCP Voice Assistant in Python
December 17, 2025 · 6 min read

Learn how to build a local MCP voice assistant using a local LLM to handle function calling, speech-to-text, text-to-speech, and external API integration in this step-by-step MCP tutorial.

Blog Thumbnail
Build Voice Banking Assistant with Python SDK
December 16, 2025 · 3 min read

Build a banking voice AI agent with custom wake words and voice activated banking features for secure and compliant financial applications.

Blog Thumbnail
Get Word-Level Confidence in Speech-to-Text with Python
December 16, 2025 · 2 min read

Learn how to get word-level confidence scores in Python for speech-to-text. Set word confidence thresholds to improve transcription quality.

Blog Thumbnail
How to Build Cross-Platform Speaker Recognition in C: Complete Tutorial
December 15, 2025 · 6 min read

Step-by-step tutorial: Build cross-platform speaker recognition in C using Picovoice Eagle. Includes complete code for speaker enrollment & recognition on Linux, Windows, macOS, and Raspberry Pi.