Google STT - Speaker Diarization Benchmark
Prerequisites
- Ubuntu 20.04 (x86_64)
- Git
- Python 3.7+
- PIP
- Google Cloud Account
Usage
- Clone the repository:
- Install the dependencies:
Set up the dataset as described in the main readme of the repository.
Create a Google Cloud storage bucket on you Google Cloud account.
Run the benchmark:
Where:
typeis the type of benchmark to run. It can beACCURACY,CPU, orMEMORY.datasetis the name of the dataset to use.data-folderis the path to the folder containing the audio files.label-folderis the path to the folder containing the label files.gcp-credentialsis the path to the Google Cloud credentials file.gcp-bucket-nameis the name of the Google Cloud storage bucket.gcp-regionis the region of the Google Cloud storage bucket.engineis the name of the engine to benchmark. It can beGOOGLE_SPEECH_TO_TEXTorGOOGLE_SPEECH_TO_TEXT_ENHANCEDdepending on the type of benchmark to run.