Soprano-TTS - Text-to-Speech Benchmark

Prerequisites

Ubuntu 20.04 (x86_64)
Git
Python 3.10
Picovoice Console Account

Usage

Clone the repository:

git clone https://github.com/Picovoice/text-to-speech-benchmark.git

Install the dependencies:

pip3 install -r requirements/requirements-soprano-tts.txt

Download the picoLLM model

For each benchmark a picoLLM model is required to generate responses from the LLM. The picoLLM model used in the benchmark is llama-3.2-1b-instruct-385 and can be downloaded from Picovoice Console.

Run the benchmark:

Hugging face model download commit hash:

Repo: ekwek/Soprano-1.1-80M.
Commit hash: 27b5a5f5f541a1db3a51d6fd1b0fc7147b92cd01.

For core hour ratio & latency metrics:

python3 benchmark.py \
--picovoice-access-key ${PV_ACCESS_KEY} \
--picollm-model-path ${PICOLLM_MODEL_PATH} \
--engine soprano_tts \

For peak memory metric:

for i in 1 2 3 4 5 6 7 8 9 10 20 40 80; do
    python3 benchmark.py \
    --picovoice-access-key ${PV_ACCESS_KEY} \
    --picollm-model-path ${PICOLLM_MODEL_PATH} \
    --engine soprano_tts \
    --test-memory-size-multiple $i \
done

Replace ${PICOLLM_MODEL_PATH} with the path to the model you downloaded. Replace ${PV_ACCESS_KEY} with your AccessKey obtained from Picovoice Console. Everyone who signs up for Picovoice Console receives a unique AccessKey.

Was this doc helpful?

Issue with this doc?