Cheetah Speech-to-Text
Web Quick Start
Platforms
- Chrome & Chromium-based browsers
- Edge
- Firefox
- Safari
Requirements
- Picovoice Account and AccessKey
- Node.js 16+
- npm
Picovoice Account & AccessKey
Signup or Login to Picovoice Console to get your AccessKey.
Make sure to keep your AccessKey secret.
Quick Start
Setup
Install Node.js.
Install the Web Voice Processor and the Cheetah Streaming Speech-to-Text Web package:
Usage
Generate a custom Cheetah Streaming Speech-to-Text model from Picovoice Console or download the default model.
Put the model file in the project's public directory or generate a base64 model using the built-in script:
Create a CheetahWorker instance using a base64 model or a model hosted in a public directory:
Subscribe CheetahWorker to WebVoiceProcessor to start processing audio frames:
Once done, unsubscribe to stop processing audio frames:
Release resources explicitly when done with Cheetah:
Non-English Languages
In order to use Cheetah with other languages, you need to use the corresponding model file (.pv) for the desired language. The model files for all
supported languages are available on the Cheetah GitHub repository.
Demo
For the Cheetah Streaming Speech-to-Text Web SDK, there is a Web demo project available on the Cheetah Streaming Speech-to-Text GitHub repository.
Setup
Clone the Cheetah Streaming Speech-to-Text repository from GitHub:
Usage
- Install dependencies:
- Run the demo with the
startscript with a language code to start a local web server hosting the demo in the language of your choice (e.g.de-> German,es-> Spanish). To see a list of available languages, run start without a language code.
- Open http://localhost:5000 to view it in the browser.
Resources
Package
API
GitHub
- Cheetah Streaming Speech-to-Text Web SDK on GitHub
- Cheetah Streaming Speech-to-Text Web Demo on GitHub