Blog

Blog Thumbnail
Audio Sampling and Sample Rate
April 9, 2024 · 2 min read

Audio sampling, or sampling, refers to the process of converting a continuous analog audio signal into a discrete digital signal....

Blog Thumbnail
Choosing the Best Text-to-Speech: A Comprehensive Guide
April 9, 2024 · 2 min read

Choosing the best Text-to-Speech (TTS) depends on your needs and requirements....

Blog Thumbnail
JavaScript Speech Recognition
April 8, 2024 · 2 min read

Learn how to perform Speech Recognition in JavaScript, including Speech-to-Text, Voice Commands, Wake Word Detection, and Voice Activity Det...

Blog Thumbnail
Text-to-Speech using JavaScript
April 8, 2024 · 1 min read

Synthesize text to speech using Picovoice Orca Text-to-Speech Web SDK. The SDK runs on all modern web browsers....

Blog Thumbnail
Android Speaker Diarization
March 26, 2024 · 1 min read

Learn how to perform Speaker Diarization in Android. Picovoice Falcon Speaker Diarization SDK runs on mobile, desktop, and embedded platform...

Blog Thumbnail
5 Reasons Why Working at a Startup is (not) a Great Idea
March 22, 2024 · 5 min read

Some people know they are born to start their own business, work at a large organization, or work at a startup. Yet, many candidates prefer ...

Blog Thumbnail
Difference Between Speaker Diarization and Speaker Identification
March 22, 2024 · 1 min read

The information captured from voice data is not limited to transcription of uttered words. Our voices capture information about our age, gen...

Blog Thumbnail
Whisper Speech-to-Text Alternative for Real-time Transcription
March 22, 2024 · 2 min read

Some startups, like Deepgram, have started offering hosted Whisper along with their own speech-to-text offerings, and some startups and open...

Blog Thumbnail
Real-time Speaker Identification with Node.js
March 14, 2024 · 2 min read

Learn how to easily create a speaker identification app using Picovoice's Eagle Node.js SDK. On-device speaker identification with cloud-lev...

Blog Thumbnail
Lumina - AI Art Generator using Voice Prompts in Python
March 4, 2024 · 3 min read

Lumina, powered by OpenAI DALL-E 3 and Picovoice’s wake word, voice activity detection, speech-to-text, and audio recorders, is an AI Art G...

Blog Thumbnail
Picovoice Interviews: What’s the ROC Curve?
March 4, 2024 · 5 min read

Picovoice serves ML researchers and developers with its resource-efficient AI models and engines. Thus, we ask take-home questions related t...

Blog Thumbnail
State of Generative AI for Audio in 2024
March 4, 2024 · 2 min read

Generative AI for Audio has made significant progress in recent years, enabling the creation of high-quality audio content....

Blog Thumbnail
Speaker Diarization for Web Applications using JavaScript
March 1, 2024 · 1 min read

Learn how to easily create a speaker diarization app using Picovoice's Falcon Web SDK. The SDK runs on all modern web browsers....

Blog Thumbnail
Speaker Recognition for Web Applications using JavaScript
March 1, 2024 · 2 min read

Learn how to easily create a speaker recognition app using Picovoice's Eagle Web SDK. The SDK runs on all modern web browsers....

Blog Thumbnail
Voice Activity Detection in Node.js
March 1, 2024 · 2 min read

Detect voice activity in real time with Cobra Voice Activity Detection engine. Cobra Node.js SDK runs on Linux, macOS, Windows, Raspberry Pi...

Blog Thumbnail
Orca Text-to-Speech on Raspberry Pi
February 23, 2024 · 1 min read

Run a neural Text-to-Speech engine on Raspberry Pi using Orca Text-to-Speech. Build offline voice interfaces and applications....