Neu-TTS-Nano-Q4-GGUF - Text-to-Speech Benchmark
Prerequisites
- Ubuntu 20.04 (x86_64)
- Git
- Python 3.10
- Picovoice Console Account
Usage
- Clone the repository:
- Install the dependencies:
- Download the picoLLM model
For each benchmark a picoLLM model is required to generate responses from the LLM.
The picoLLM model used in the benchmark is llama-3.2-1b-instruct-385 and can be
downloaded from Picovoice Console.
- Run the benchmark:
Hugging face model download commit hash:
- Repo:
neuphonic/neutts-nano-q4-gguf.- Commit hash:
8ae1694877fdf9d7c4a7bee2cc9775ba7eab3923.
- Commit hash:
- Repo:
neuphonic/neucodec-onnx-decoder.- Commit hash:
55b95ccfb0b0a63bd033f0f78e6366607a616a33.
- Commit hash:
For core hour ratio & latency metrics:
For peak memory metric:
Replace ${REF_TEXT_PATH} with the path to the reference text for voice cloning. E.g. --neutts-ref-text-path ~/neutts/samples/jo.txt. Replace ${REF_CODES_PATH} with the path to the reference codes for voice cloning. E.g. --neutts-ref-codes-path ~/neutts/samples/jo.pt.
Replace ${PICOLLM_MODEL_PATH} with the path to the model you downloaded.
Replace ${PV_ACCESS_KEY} with your AccessKey obtained from Picovoice Console.
Everyone who signs up for Picovoice Console receives a unique AccessKey.