Cheetah Speech-to-Text
Web Quick Start
Platforms
- Chrome & Chromium-based browsers
- Edge
- Firefox
- Safari
Requirements
- Picovoice Account and AccessKey
- Node.js 16+
- npm
Picovoice Account & AccessKey
Signup or Login to Picovoice Console to get your AccessKey
.
Make sure to keep your AccessKey
secret.
Quick Start
Setup
Install Node.js.
Install the Web Voice Processor and the Cheetah Streaming Speech-to-Text Web package:
Usage
Generate a custom Cheetah Streaming Speech-to-Text
model from Picovoice Console or download the default model.
Put the model file in the project's public directory or generate a base64 model using the built-in script:
Create a CheetahWorker
instance using a base64 model or a model hosted in a public directory:
Subscribe CheetahWorker
to WebVoiceProcessor to start processing audio frames:
Once done, unsubscribe to stop processing audio frames:
Release resources explicitly when done with Cheetah:
Non-English Languages
In order to use Cheetah with other languages, you need to use the corresponding model file (.pv
) for the desired language. The model files for all
supported languages are available on the Cheetah GitHub repository.
Demo
For the Cheetah Streaming Speech-to-Text Web SDK, there is a Web demo project available on the Cheetah Streaming Speech-to-Text GitHub repository.
Setup
Clone the Cheetah Streaming Speech-to-Text repository from GitHub:
Usage
- Install dependencies:
- Run the demo with the
start
script with a language code to start a local web server hosting the demo in the language of your choice (e.g.de
-> German,es
-> Spanish). To see a list of available languages, run start without a language code.
- Open http://localhost:5000 to view it in the browser.
Resources
Package
API
GitHub
- Cheetah Streaming Speech-to-Text Web SDK on GitHub
- Cheetah Streaming Speech-to-Text Web Demo on GitHub