On-device transcription with cloud-level accuracy bringing control back to enterprises
Leopard Speech-to-Text is software that converts audio and video recordings into text with cloud-level accuracy without sacrificing privacy.
Leopard Speech-to-Text brings speech recognition to where data resides, enabling transcription on the device, mobile, web browsers, on-prem, or cloud.
o = pvleopard.create(access_key)transcript, words =o.process_file(path)Build with Python
const o = new Leopard(accessKey)const { transcript, words } =o.processFile(path)Build with NodeJS
Leopard o = new Leopard.Builder().setAccessKey(accessKey).setModelPath(modelPath).build(appContext);LeopardTranscript r =o.processFile(path);Build with Android
let o = Leopard(accessKey: accessKey,modelPath: modelPath)let r = o.processFile(path)Build with iOS
o = NewLeopard(accessKey)err := o.Init()transcript, words, err:= o.ProcessFile(path)Build with Go
Leopard o = new Leopard.Builder().setAccessKey(accessKey).build();LeopardTranscript r =o.processFile(path);Build with Java
Leopard o =Leopard.Create(accessKey);LeopardTranscript result =o.ProcessFile(path);Build with .NET
let o: Leopard =LeopardBuilder::new().access_key(access_key).init().expect("");if let Ok(result) =o.process_file(path) { }Build with Rust
Leopard o = await Leopard.create(accessKey,modelPath);LeopardTranscript result =await o.processFile(path);Build with Flutter
const o = await Leopard.create(accessKey,modelPath)const {transcript, words} =await o.processFile(path)Build with React Native
pv_leopard_t *leopard = NULL;pv_leopard_init(access_key,model_path,enable_automatic_punctuation,&leopard);char *transcript = NULL;int32_t num_words = 0;pv_word_t *words = NULL;pv_leopard_process_file(leopard,path,&transcript,&num_words,&words);Build with C
const leopard =await LeopardWorker.fromPublicDirectory(accessKey,modelPath);const {transcript,words} =await leopard.process(pcm);Build with Web
Speech-to-text APIs require enterprises to send their data to a 3rd party cloud, giving away control over their data and product.
Leopard Speech-to-Text offers the same performance with no compromises.
Creating new possibilities for your content, product, and database
Leopard Speech-to-Text offers cloud-level accuracy, model customization, and cross-platform support…
…without sacrificing privacy, reliability, and affordability, enabling use cases that were impossible before.
Evaluate the accuracy of Leopard Speech-to-Text vs other transcription APIs scientifically with the open-source speech-to-text benchmark, enabling you to make decisions confidently with your data.
Customize pre-trained speech-to-text models instantly by adding domain-specific vocabulary and boosting frequently-used words on a self-service platform, achieving the highest possible accuracy.
Deploy Leopard Speech-to-Text anywhere and offer seamless experiences across devices, mobile apps, web browsers, on-premise, cloud, or all.
Process voice data without sharing it with 3rd parties, ensuring compliance with GDPR, HIPAA, CCPA, and more - including any policies that come in the future.
Build reliable products with predictable response times by bringing speech-to-text closer to your data to bypass network latency, congestion, outages, or throttling.
Do not bear the cost of running bulky models in the cloud. Big Tech uses on-device speech-to-text for their products because running large models in the cloud is costly, even for them.
The best way to learn about Leopard Speech-to-Text is to use it!
Start NowSpeech-to-text (STT), also known as Automatic Speech Recognition (ASR) and Open-Domain Large Vocabulary Speech Recognition (LVSR), refers to the technology and methodologies that convert voice data into text.
Cloud-based speech-to-text APIs send voice data to vendors’ servers, where the transcription engine resides. On-device voice processing brings voice recognition where voice data is, eliminating all the steps related to cloud processing.