Soprano-TTS - Text-to-Speech Benchmark
Prerequisites
- Ubuntu 20.04 (x86_64)
- Git
- Python 3.10
- Picovoice Console Account
Usage
- Clone the repository:
- Install the dependencies:
- Download the picoLLM model
For each benchmark a picoLLM model is required to generate responses from the LLM.
The picoLLM model used in the benchmark is llama-3.2-1b-instruct-385 and can be
downloaded from Picovoice Console.
- Run the benchmark:
Hugging face model download commit hash:
- Repo:
ekwek/Soprano-1.1-80M. - Commit hash:
27b5a5f5f541a1db3a51d6fd1b0fc7147b92cd01.
For core hour ratio & latency metrics:
For peak memory metric:
Replace ${PICOLLM_MODEL_PATH} with the path to the model you downloaded.
Replace ${PV_ACCESS_KEY} with your AccessKey obtained from Picovoice Console.
Everyone who signs up for Picovoice Console receives a unique AccessKey.