Learn how to detect voice activity in audio data using Picovoice Cobra Voice Activity Detection (VAD) SDK. The SDK runs on Linux, macOS, Windows, Raspberry Pi, NVIDIA Jetson, and BeagleBone.

Install VAD SDK

Sign up for Picovoice Console

Log in to (sign up for) Picovoice Console. It is free, and no credit card is required! Copy your AccessKey to the clipboard.

Implement in Python

Import Cobra's Python package and create an instance of the VAD engine with your AccessKey:

When initialized, the valid sample rate is given by handle.sample_rate. The expected frame length (number of audio samples in an input array) is handle.frame_length. The engine accepts 16-bit linearly-encoded PCM and operates on single-channel audio.

Below is an example output of Cobra on a test audio file:

Cobra VAD output example