Go Quick Start
- Linux (x86_64)
- macOS (x86_64, arm64)
- Windows (x86_64)
- NVIDIA Jetson Nano
- Raspberry Pi (3, 4)
Picovoice Account & AccessKey
Signup or Login to Picovoice Console to get your
Make sure to keep your
Create an instance of the Leopard engine:
Transcribe an audio file:
When done be sure to explicitly release the resources using
The Leopard Go SDK comes preloaded with a default English language model (
Default models for other supported languages can be found in the Leopard GitHub repository.
Create custom language models using the Picovoice Console. Here you can train language models with custom vocabulary and boost words in the existing vocabulary.
Pass in the
.pv file by setting
.ModelPath on an instance of Leopard before initializing:
Along with the transcript, Leopard returns metadata for each transcribed word. Available metadata items are:
- Start Time: Indicates when the word started in the transcribed audio. Value is in seconds.
- End Time: Indicates when the word ended in the transcribed audio. Value is in seconds.
- Confidence: Leopard's confidence that the transcribed word is accurate. It is a number within
- Speaker Tag: If speaker diarization is enabled on initialization, the speaker tag is a non-negative integer identifying unique speakers, with
0reserved for unknown speakers. If speaker diarization is not enabled, the value will always be
For the Leopard Go SDK, we offer demo applications that demonstrate how to use the Speech-to-Text engine on audio files.
Clone the Leopard repository from GitHub using HTTPS:
To see the usage options for the demos, use the
Run the following command to transcribe an audio file:
For more information on our Leopard demos for Go, head over to our GitHub repository.