In the age of Netflix, Tiktok, Youtube, Twitch, Zoom and podcasts, we have more devices, mobile, and web applications generating data. UpKeep predicts that 175 ZB of new data will be generated by 2025. More than 80% of it will be unstructured, including audio and video. The need for making audio and video content searchable, just like Google Search Engine did for websites, is growing. Enabling keyword search for audio for monitoring, compliance, and further analysis helps enterprises reduce their risks. However, given the scale of data, it’s not easy nor affordable for everyone. The standard approach to Voice Search has two steps. First, convert voice to text via a speech-to-text engine, then perform a text-based keyword search. However, speech-to-text engines struggle with the proper names like brands, people, industry-specific jargon, and homophones. Some speech-to-text solutions, like Leopard Speech-to-Text, allow customization to a certain degree. Even when tuning is not required, transcribing voice in the cloud has inherent costs. These costs can be a show-stopper even for large enterprises considering the millions of hours of voice data. Not anymore! On-device voice recognition enables enterprises to analyze audio and video files at a small fraction of cloud API costs.
Moving to Picovoice for Numina Group’s Victory Voice solution provided a robust speaker-independent voice recognizer. Picovoice is fast, accurate and supports multiple languages. Both the software tools and technical support services are top-notch. The team is great to work with, they are responsive and accommodating.
Open-source, open-data NLU benchmark results show Picovoice Rhino outperforms alternative NLU engines, such as Amazon Lex, Google Dialogflow, IBM Watson and Microsoft Luis. Rhino Speech-to-Intent fuses speech-to-text (STT) and natural language understanding (NLU) to give all you need to add voice commands.
Picovoice speech-to-text and voice search engines are at least 10x more affordable than Amazon, Google, IBM and Microsoft. Enterprises avoid surprising cloud bills with simple and predictable pricing.
Every business and use case has different requirements. Customize Picovoice engines or use them together to find spoken keywords in audio files or real-time conversations.