Blog

Blog Thumbnail
Text-to-Speech in Python: On-Device Solutions
September 21, 2023 · 1 min read

Learn about on-device Text-to-Speech in Python. Explore three Text-to-Speech frameworks in action to synthesize speech directly on your devi...

Blog Thumbnail
Understanding LLM.int8()
September 21, 2023 · 2 min read

LLMs are highly useful, yet their runtime requirements are eye-watering. Learn how LLM.int8() quantizes LLMs and reduces their memory requir...

Blog Thumbnail
iOS Speech Recognition
September 20, 2023 · 3 min read

Learn how to perform Speech Recognition in iOS, including Speech-to-Text, Voice Commands, Wake Word Detection, and Voice Activity Detection....

Blog Thumbnail
iOS Speech to Text
September 17, 2023 · 1 min read

Learn how to transcribe speech to text on an iOS device. Picovoice Leopard and Cheetah Speech-to-Text SDKs run on mobile, desktop, and embed...

Blog Thumbnail
Noise Suppression in Android
September 15, 2023 · 1 min read

Learn how to use Koala noise suppression engine to mute unwanted background noise across different Android devices....

Blog Thumbnail
Real-Time Transcription in Node.js
September 14, 2023 · 2 min read

Transcribe speech to text in real time with Cheetah Streaming Speech-to-Text. Cheetah Node.js SDK runs on Linux, macOS, Windows, Raspberry P...

Blog Thumbnail
Speaker Diarization in Python
September 13, 2023 · 1 min read

Learn about speaker diarization in Python. Explore speaker diarization frameworks in action, unraveling their potential with a simple task....

Blog Thumbnail
Enhance Audio with AI-Powered Noise Suppression
September 12, 2023 · 1 min read

Koala Noise Suppression has become the developer’s choice, especially for real-time audio enhancement....

Blog Thumbnail
DaVinci - ChatGPT AI Virtual Assistant in Python
September 12, 2023 · 2 min read

ChatGPT has become one of the most popular AI algorithms since its release in November 2022. Developers and enterprises immediately started ...

Blog Thumbnail
Understanding Differences Among CPU vs. GPU vs. TPU vs. NPU
September 12, 2023 · 2 min read

A decade ago, popular processing units were Central Processing Units (CPUs) and Graphics Processing Units (GPUs). Advances in artificial int...

Blog Thumbnail
Choosing the Best Noise Cancellation: NVIDIA RTX Voice, Krisp, or Your Own?
September 12, 2023 · 2 min read

With the increasing prevalence of audio and video communications, the need for enhancing audio quality and removing background noise also ha...

Blog Thumbnail
Prompt Engineering
September 12, 2023 · 1 min read

Prompt Engineering has emerged as a crucial technique to maximize the effectiveness of AI models. Prompt Engineering has become popular with...

Blog Thumbnail
Top Free and Commercial Speaker Diarization APIs and SDKs
September 12, 2023 · 2 min read

The industry practice is to embed Speaker Diarization into commercial speech-to-text systems. Developers generally use the default Speaker D...

Blog Thumbnail
Understanding User Interfaces: Choosing the Right Interface for Your Application
September 12, 2023 · 2 min read

User Interfaces determine the success of software. Several factors, such as ease of use, efficiency, accessibility, and aesthetics, affect h...

Blog Thumbnail
Verbatim Transcription: Use Cases & Benefits
September 12, 2023 · 1 min read

Verbatim Transcriptions have become indispensable across various industries and niches, leaving product teams to struggle to choose Verbatim...

Blog Thumbnail
Voice Prompts vs. Text Prompts
September 12, 2023 · 1 min read

A Prompt is an input that guides the model to generate the desired output. As the term Prompt has become popular with large language models,...