Blog

Blog Thumbnail
Speaker Diarization for Web Applications using JavaScript
March 1, 2024 · 1 min read

Learn how to easily create a speaker diarization app using Picovoice's Falcon Web SDK. The SDK runs on all modern web browsers....

Blog Thumbnail
Speaker Recognition for Web Applications using JavaScript
March 1, 2024 · 2 min read

Learn how to easily create a speaker recognition app using Picovoice's Eagle Web SDK. The SDK runs on all modern web browsers....

Blog Thumbnail
Voice Activity Detection in Node.js
March 1, 2024 · 2 min read

Detect voice activity in real time with Cobra Voice Activity Detection engine. Cobra Node.js SDK runs on Linux, macOS, Windows, Raspberry Pi...

Blog Thumbnail
Orca Text-to-Speech on Raspberry Pi
February 23, 2024 · 1 min read

Run a neural Text-to-Speech engine on Raspberry Pi using Orca Text-to-Speech. Build offline voice interfaces and applications....

Blog Thumbnail
Meet Orca Text-to-Speech: On-device Voice Generator
February 12, 2024 · 2 min read

Private, Efficient, Fast, Ready-to-Use Text-to-Speech: Orca Text-to-Speech, the on-device voice generator that converts written text into sp...

Blog Thumbnail
Eagle: Speaker Recognition and Identification for Developers
February 8, 2024 · 1 min read

Today, Picovoice is pleased to announce the public beta release of its Speaker Recognition engine, Eagle....

Blog Thumbnail
Transcribe and Summarize YouTube videos using Twilio, ChatGPT, and Leopard Speech-to-Text in Node.JS
January 17, 2024 · 1 min read

Have you ever thought of getting a summary of a YouTube video by sending a WhatsApp message? Ezzeddin Abdullah built an application that tra...

Blog Thumbnail
Adding Speaker Diarization to OpenAI Whisper using Picovoice Falcon
January 15, 2024 · 1 min read

Learn how to integrate speaker diarization into OpenAI Whisper Speech-to-Text using Picovoice Falcon in Python...

Blog Thumbnail
State of Speaker Diarization in 2023
December 18, 2023 · 1 min read

Speaker Diarization is figuring out “who spoke when?”. In practice, it’s about “who spoke when and what?”. Let me expand....

Blog Thumbnail
Top Free and Commercial Speaker Diarization APIs and SDKs
December 18, 2023 · 2 min read

The industry practice is to embed Speaker Diarization into commercial speech-to-text systems. Developers generally use the default Speaker D...

Blog Thumbnail
Offline Speech-to-Text Features
December 12, 2023 · 2 min read

The launch of Leopard Speech-to-Text and Cheetah Speech-to-Text for streaming brought cloud-level automatic speech recognition (ASR) to loca...

Blog Thumbnail
Picovoice On-Device Voice AI Platform in 2023
December 12, 2023 · 1 min read

2023 was another busy year at Picovoice with new products, languages, SDKs, team members and customers....

Blog Thumbnail
Falcon: Speaker Diarization for Developers
December 4, 2023 · 2 min read

Falcon Speaker Diarization is 100x more efficient than pyannote Speaker Diarization and diarizes speakers 5x more accurately than Google Spe...

Blog Thumbnail
Real-time Transcription with React.js
November 15, 2023 · 1 min read

Transcribe speech-to-text in real-time using Picovoice Cheetah Streaming Speech-to-Text React.js SDK. The SDK runs on Linux, macOS, Windows,...

Blog Thumbnail
Speech-to-Text with React.js
November 10, 2023 · 1 min read

Transcribe speech to text using Picovoice Leopard speech-to-text React.js SDK. The SDK runs on Linux, macOS, Windows, Raspberry Pi, and NVID...

Blog Thumbnail
AI Overfitting and Underfitting for Executives
November 6, 2023 · 2 min read

Overfitting and Underfitting are modeling errors in statistics. Artificial Intelligence algorithms are mathematical prediction models....