Phonetic-based keyword search engine for audio streams, enabling search in massive libraries in seconds
Octopus Speech-to-Index is a search engine that indexes speech directly without converting it into text, enabling a keyword search within audio and video files.
Octopus Speech-to-Index finds any keyword, including proper names or slang, without knowing the exact spelling, removing the limitations of automated transcription solutions.
o = pvoctopus.create(access_key)metadata =o.index_audio_file(path)matches = o.search(metadata,phrases)
Enterprises use automated transcription to find keywords and phrases in the audio and video libraries, despite not being built for this purpose. Automated transcription struggles with homophones and cannot transcribe words if not in the dictionary.
Octopus Speech-to-Index uses an acoustic-based search, achieving much higher accuracy than a generic transcription engine.
Monetize your content, monitor conversations, or ensure compliance without the limitations of automatic transcription.
Reduce errors by four times compared to Google Speech-to-Text. The open-source benchmark shows Octopus Speech-to-Index is the right tool for the job and outperforms the workarounds.
Make audio and video files searchable in seconds. Compared to Mozilla DeepSpeech, Octopus Speech-to-Index processes voice data 50 times faster while returning ten times more accurate results.
Protect sensitive information, such as call center recordings with personal data or legal depositions with confidential information. Automated transcription APIs send voice data to a 3rd party cloud to process it, while Octopus Speech-to-Index processes voice data anywhere.
The best way to learn about Octopus Speech-to-Index is to use it!
Start NowSpeech-to-Index, also known as audio indexing, speech indexing, and acoustic indexing, is a technique that makes audio automatically searchable and discoverable. As it performs searches based on phonetics, it’s also known as phonetic search, phonetic-based search, and acoustic search. It allows quick searches and rapid access to audio content. Picovoice built Octopus Speech-to-Index as a response to market demand. It indexes even massive audio and media libraries as Google indexes websites and returns keyword search results.
Octopus Speech-to-Index is built for finding keywords and phrases, whereas speech-to-text is for generic transcription. Given the maturity of text indexing algorithms, transcribing voice to text and then performing a search based on text seem like a good workaround to many. However, speech-to-text has limitations in correctly identifying these proper nouns and homophones. (Katia Leighton vs. Katja Layton and fair vs. fare). Acknowledging speech-to-text limitations, Picovoice built an acoustic-based phonetic search engine, Octopus Speech-to-Index, dedicated to finding keywords and phrases in audio libraries with high accuracy and speed.