Picovoice Wordmark
Start Building
Introduction
Introduction
AndroidC.NETFlutterlink to GoiOSJavaNVIDIA JetsonLinuxmacOSNodejsPythonRaspberry PiReact NativeRustWebWindows
AndroidC.NETFlutterlink to GoiOSJavaNodejsPythonReact NativeRustWeb
SummaryPicovoice LeopardAmazon TranscribeAzure Speech-to-TextGoogle ASRGoogle ASR (Enhanced)IBM Watson Speech-to-Text
FAQ
Introduction
AndroidC.NETFlutterlink to GoiOSJavaNVIDIA JetsonLinuxmacOSNodejsPythonRaspberry PiReact NativeRustWebWindows
AndroidC.NETFlutterlink to GoiOSJavaNodejsPythonReact NativeRustWeb
FAQ
Introduction
AndroidCiOSLinuxmacOSPythonWebWindows
AndroidCiOSPythonWeb
SummaryOctopus Speech-to-IndexGoogle Speech-to-TextMozilla DeepSpeech
FAQ
Introduction
AndroidAngularArduinoBeagleBoneCChrome.NETEdgeFirefoxFlutterlink to GoiOSJavaNVIDIA JetsonLinuxmacOSMicrocontrollerNodejsPythonRaspberry PiReactReact NativeRustSafariUnityVueWebWindows
AndroidAngularC.NETFlutterlink to GoiOSJavaMicrocontrollerNodejsPythonReactReact NativeRustUnityVueWeb
SummaryPorcupineSnowboyPocketSphinx
Wake Word TipsFAQ
Introduction
AndroidAngularBeagleBoneCChrome.NETEdgeFirefoxFlutterlink to GoiOSJavaNVIDIA JetsonLinuxmacOSNodejsPythonRaspberry PiReactReact NativeRustSafariUnityVueWebWindows
AndroidAngularC.NETFlutterlink to GoiOSJavaNodejsPythonReactReact NativeRustUnityVueWeb
SummaryPicovoice RhinoGoogle DialogflowAmazon LexIBM WatsonMicrosoft LUIS
Expression SyntaxFAQ
Introduction
AndroidBeagleboneCiOSNVIDIA JetsonLinuxmacOSPythonRaspberry PiRustWebWindows
AndroidCiOSPythonRustWeb
SummaryPicovoice CobraWebRTC VAD
FAQ
Introduction
AndroidCiOSNVIDIA JetsonLinuxmacOSPythonRaspberry PiWebWindows
AndroidCiOSPythonWeb
SummaryPicovoice KoalaMozilla RNNoise
Introduction
AndroidCiOSNVIDIA JetsonLinuxmacOSPythonRaspberry PiWebWindows
AndroidCPythoniOSWeb
Introduction
AndroidAngularArduinoBeagleBoneC.NETFlutterlink to GoiOSJavaNVIDIA JetsonMicrocontrollerNodejsPythonRaspberry PiReactReact NativeRustUnityVueWeb
AndroidAngularCMicrocontroller.NETFlutterlink to GoiOSJavaNodejsPythonReactReact NativeRustUnityVueWeb
Picovoice SDK - FAQ
IntroductionSTM32F407G-DISC1 (Arm Cortex-M4)STM32F411E-DISCO (Arm Cortex-M4)STM32F769I-DISCO (Arm Cortex-M7)IMXRT1050-EVKB (Arm Cortex-M7)
Introduction
AndroidC.NETFlutterlink to GoiOSNodejsPythonReact NativeRustUnityWeb
AndroidC.NETFlutterlink to GoiOSNodejsPythonReact NativeRustUnityWeb
FAQGlossary

Introduction

Picovoice is the developer-first platform for building voice products on your terms. Unlike Amazon or Google, Picovoice’s AI runs entirely on-device while being more accurate. You can use Picovoice for keyword spotting, voice commands, voice user interfaces (VUI), phonetic search, automatic speech recognition (ASR), speech-to-text (STT), and voice activity detection (VAD).

Free Plan

Start building for free. No credit card is required!

Platform Features

  • Accurate: Resilient to noise and reverberation. Picovoice outperforms cloud-based alternatives and backs its claims with open-source benchmarks:
    • Wake Word Benchmark (KWS & hotword)
    • Speech-to-Intent Benchmark (VUI & NLU)
    • Speech-to-Text Benchmark (ASR & STT)
    • Speech-to-Index Benchmark (phonetic search)
    • Voice Activity Detection Benchmark (VAD)
    • Noise Suppression Benchmark (Speech Enhancement)
  • Private & Secure: Voice recognition is entirely offline. Intrinsically private; HIPAA and GDPR-compliant.
  • Zero-Latency: Edge-first architecture eliminates unpredictable network delay.
  • Self-Service: Design, train, and test voice interfaces instantly in your browser, using Picovoice Console.
  • Cross-Platform: Design once, deploy anywhere. Build using familiar languages and frameworks.

Flow

Design

Picovoice Console is the cloud-based platform for training custom speech models, all within your web browser. No machine learning skills are required. Describe what you need with text and export trained models. Picovoice Console supports training branded wake words, context-aware voice commands, and custom ASR models.

Develop

Exported models from Picovoice Console can run on Picovoice SDKs. Picovoice SDKs run on many platforms and support most frameworks. The Picovoice Console and SDKs enable you to design, build and iterate fast.

Deploy

Start for free by taking advantage of Picovoice Free Plan, even commercially. Deploy without having to maintain complex cloud infrastructure. Avoid hefty API-based pricing, limitations, and control imposed by big tech.

Getting Started

Speech-to-Text

Picovoice Leopard Speech-to-Text is the only on-device ASR engine that matches and exceeds cloud-level accuracy while being under 20 MB in size. You can use Picovoice Console to train models that understand the custom vocabulary that matters to your use case.

Streaming Speech-to-Text

Picovoice Cheetah Streaming Speech-to-Text is the only on-device real-time ASR engine that matches and exceeds cloud-level accuracy while being under 20 MB in size.

Phonetic Search

Picovoice Octopus Speech-to-Index directly searches speech data without relying on a text representation. It outperforms cloud-based speech-to-text engines on search tasks by removing the out-of-vocabulary and competing hypothesis problems.

Noise Suppression

Picovoice Koala Noise Suppression is the only high-quality and cross-platform on-device noise suppression engine.

Speaker Recognition

Picovoice Eagle Speaker Recognition is the only enterprise-grade and freely-available speaker recognition engine.

Wake Word

Picovoice Porcupine Wake Word engine detects utterances of given wake phrases. You can train custom wake words using Picovoice Console and then run the exported wake word models on the Picovoice SDK.

Voice Commands

Picovoice Rhino Speech-to-Intent engine directly infers user's intent from spoken commands within a given domain of interest (context). You can design and train custom contexts for your product using Picovoice Console.

Voice Activity Detection

Picovoice Cobra Voice Activity Detector engine detects the presence of speech in a stream of audio in real-time.

Technical Support

GitHub issues are the only free support channel we offer. You can also purchase an Enterprise Support Add-on for dedicated support.

Commercial Inquiries

Please refer to our pricing page to learn about our business model.

Was this doc helpful?

Issue with this doc?

Report a GitHub Issue
Introduction
  • Free Plan
  • Platform Features
  • Flow
  • Design
  • Develop
  • Deploy
  • Getting Started
  • Speech-to-Text
  • Streaming Speech-to-Text
  • Phonetic Search
  • Noise Suppression
  • Speaker Recognition
  • Wake Word
  • Voice Commands
  • Voice Activity Detection
  • Technical Support
  • Commercial Inquiries
Platform
  • Leopard Speech-to-Text
  • Cheetah Streaming Speech-to-Text
  • Koala Noise Suppression
  • Eagle Speaker RecognitionBETA
  • Octopus Speech-to-Index
  • Porcupine Wake Word
  • Rhino Speech-to-Intent
  • Cobra Voice Activity Detection
  • Orca Text-to-SpeechWAITLIST
  • Falcon Speaker DiarizationWAITLIST
Resources
  • Docs
  • Console
  • Blog
  • Use Cases
Sales & Services
  • Consulting
  • Developer Plan
  • Enterprise Plan
  • Support Add-on
Company
  • About us
  • Careers
Follow Picovoice
  • LinkedIn
  • GitHub
  • Twitter
  • Medium
  • YouTube
  • AngelList
Subscribe to our newsletter
Terms of Use
Privacy Policy
© 2019-2022 Picovoice Inc.