Learn how to detect voice activity in audio data using Picovoice Cobra Voice Activity Detection (VAD) SDK.
The SDK runs on Linux
, macOS
, Windows
, Raspberry Pi
, NVIDIA Jetson
, and BeagleBone
.
Install VAD SDK
Sign up for Picovoice Console
Log in to (sign up for) Picovoice Console . It is free, and no credit card is required!
Copy your AccessKey
to the clipboard.
Implement in Python
Import Cobra's Python package and create an instance of the VAD engine with your AccessKey
:
When initialized, the valid sample rate is given by handle.sample_rate
. The expected frame length (number of audio samples
in an input array) is handle.frame_length
. The engine accepts 16-bit linearly-encoded PCM and operates on single-channel
audio.
Below is an example output of Cobra on a test audio file:
