Cheetah Speech-to-Text
Android Quick Start

Platforms

Android (5.0+)

Requirements

Picovoice Account and AccessKey
Android Studio
Android device with USB debugging enabled or Android simulator

Picovoice Account & AccessKey

Signup or Login to Picovoice Console to get your AccessKey. Make sure to keep your AccessKey secret.

Quick Start

Setup

Install Android Studio.
Include mavenCentral() repository in the top-level build.gradle. Then add the following to the app's build.gradle:

dependencies {
    // ...
    implementation 'ai.picovoice:cheetah-android:${LATEST_VERSION}' // replace with latest version
}

Add the following to the app's AndroidManifest.xml file to enable recording with an Android device's microphone:

<uses-permission android:name="android.permission.INTERNET" />
<uses-permission android:name="android.permission.RECORD_AUDIO" />

Model File

Add the Cheetah Streaming Speech-to-Text model file to your Android application:

Create a custom model using the Picovoice Console or use the default model.
Add the model as a bundled resource by placing it under the ${ANDROID_APP}/src/main/assets directory of your Android project.

Usage

Create an instance of the engine with the Cheetah Streaming Speech-to-Text Builder by passing in your AccessKey, model file and the Android app context:

import ai.picovoice.cheetah.*;

final String accessKey = "${ACCESS_KEY}"; // AccessKey provided by Picovoice Console (https://console.picovoice.ai/)
final String modelPath = "${MODEL_PATH}"; // path relative to the assets folder or absolute path to file on device

try {
    Cheetah cheetah = new Cheetah.Builder()
      .setAccessKey(accessKey)
      .setModelPath(modelPath)
      .build(appContext);
} catch (CheetahException ex) { }

Transcribe real-time audio:

short[] getNextAudioFrame() {
    // .. get audioFrame
    return audioFrame;
}

String transcript = "";

while (true) {
    try {
        short[] audioFrame = getNextAudioFrame();
        CheetahTranscript partialResult = cheetah.process(audioFrame);
        transcript += partialResult.getTranscript();

        if (partialResult.getIsEndpoint()) {
            CheetahTranscript finalResult = cheetah.flush();
            transcript += finalResult.getTranscript();
        }
    } catch (CheetahException ex) { }
}

When done, release resources explicitly:

cheetah.delete();

Non-English Languages

In order to use Cheetah with other languages, you need to use the corresponding model file (.pv) for the desired language. The model files for all supported languages are available on the Cheetah GitHub repository.

Demo

For the Cheetah Streaming Speech-to-Text Android SDK, we offer demo applications that demonstrate how to use the Speech-to-Text engine on real-time audio streams (i.e. microphone input).

Setup

Clone the Cheetah Streaming Speech-to-Text repository from GitHub using HTTPS:

git clone --recurse-submodules https://github.com/Picovoice/cheetah.git

Usage

Open the Android demo using Android Studio.
Copy your AccessKey from Picovoice Console into the ACCESS_KEY variable in MainActivity.java.
Run the application using a connected Android device or using an Android simulator.

Resources

Package

cheetah-android on Maven Central

API

cheetah-android API Docs

GitHub

Benchmark

Real-time Transcription Benchmark

Was this doc helpful?

Issue with this doc?

Cheetah Speech-to-Text Android Quick Start

Platforms

Requirements

Picovoice Account & AccessKey

Quick Start

Setup

Model File

Usage

Non-English Languages

Demo

Setup

Usage

Resources

Package

API

GitHub

Benchmark

Cheetah Speech-to-Text
Android Quick Start