Kokoro-TTS - Text-to-Speech Benchmark
Prerequisites
- Ubuntu 20.04 (x86_64)
- Git
- Python 3.10
- Picovoice Console Account
Usage
- Clone the repository:
- Install the dependencies:
- Download the picoLLM model
For each benchmark a picoLLM model is required to generate responses from the LLM.
The picoLLM model used in the benchmark is llama-3.2-1b-instruct-385 and can be
downloaded from Picovoice Console.
- Run the benchmark:
Hugging face model download commit hash:
- Repo:
hexgrad/Kokoro-82M. - Commit hash:
f3ff3571791e39611d31c381e3a41a3af07b4987.
For core hour ratio & latency metrics:
For peak memory metric:
Replace ${PICOLLM_MODEL_PATH} with the path to the model you downloaded.
Replace ${PV_ACCESS_KEY} with your AccessKey obtained from Picovoice Console.
Everyone who signs up for Picovoice Console receives a unique AccessKey.