Octopus Speech-to-Index

Find spoken words within audio files

Scalable, efficient and production-ready acoustic indexing

Start Building for Free
Press the button
to start searching with Octopus

Accurate — Backed by open benchmark, not pre-recorded videos

When it comes to accuracy, make a decision based on data, not prerecorded demos. We have created an open-source accuracy benchmark to show that Octopus Speech-to-Index achieves accuracies much higher than Google Speech-to-Text.

Open-source benchmark result shows Octopus outperforms Google Speech-to-Text & Mozilla DeepSpeech when used for Voice Search.
Press the button
to start searching with Octopus

Private — intrinsically compliant with GDPR, HIPAA and more!

Process voice data on your premise without sending it to a third-party cloud. Do not risk sensitive information such as call centre recordings with personal information or meeting transcriptions with trade secrets. Once the demo is activated, turn off your internet connection, the demo will still work!

Build with Picovoice SDKs in Minutes

Start indexing your voice data with the production-ready Octopus Speech-to-Index and the SDK of your choice. Add only a few lines of code and make your voice data searchable.

octopus = pvoctopus.create(access_key)
metadata = octopus.index_audio_file(path)
matches = octopus.search(
Build with Python
Octopus octopus = new Octopus.Builder()
OctopusMetadata metadata = octopus.indexAudioFile(path);
HashMap <String, OctopusMatch[]> matches = octopus.search(
Build with Android
let octopus = Octopus(accessKey: accessKey)
let metadata = octopus.indexAudioFile(path: path)
let matches = octopus.search(
metadata: metadata,
phrases: phrases)
Build with iOS
const octopus =
await OctopusWorkerFactory
command: "index",
input: audioSignal,
command: "search",
metadata: octopusMetadata,
searchPhrase: searchText,
Build with JavaScript
pv_octopus_t *octopus = NULL;
Build with C

Deploy On-Device, On-Premise, or any Cloud

Make all your voice data searchable without worrying about where they are. Run efficient and scalable Octopus Speech-to-Index anywhere.

Start Building
Start Building













Język polski

Tiếng Việt



Use Cases

Search By Voice

Add voice for truly hands-free search experiences on the websites, mobile applications and devices.

Read More
Voice Search

Add voice search to mobile applications, websites, and devices. Find keywords and phrases in audio, video, and streams.

Read More
Speech Analytics

Transformative customer and employee experience with speech analytics and intelligence tools powered by the only end-to-end Voice AI platform.

Read More
Voice Command

Add voice commands to devices, mobile or web applications to elevate user experience.

Read More

Learn more about Octopus Speech-to-Index Engine

  • What is Speech-to-Index?

    Speech-to-Index is a technique that makes audio automatically searchable. Audio indexing, speech indexing, acoustic indexing, and acoustic searching are other terms used in the industry and academia. It allows quick searches and rapid access to audio content. Since Octopus Speech-to-Index directly indexes speech, we coined the term Speech-to-Index.

    Octopus was born due to market demand. Especially for media & entertainment use cases, customers were asking for something like Google but for audio and video content. Octopus was born to index speech data, like Google indexes websites.

  • Why is Octopus Speech-to-Index better than using Speech-to-Text for Voice Search?

    In short, Octopus, Speech-to-Index is built for Voice Search, and Speech-to-Text is a workaround to enable search by converting voice to text.

    Voice Search has been a topic of interest to make audio content searchable and discoverable. Given the maturity of text indexing algorithms, converting voice to text and performing a search based on the transcription is one of the first things that comes to mind. However, Speech-to-Text has several limitations: the out-of-vocabulary, longtail proper names, and competing hypotheses (homophones). We say this despite our Speech-to-Text offerings because we value transparency. Google also admits these limitations: “one of the most difficult challenges of automated speech-to-text is correctly identifying these proper nouns, known as named entities.” Google has been working on it for a while by trying to improve ASR. Despite the limitations, Google and other voice technology providers try to fit speech-to-text to every problem. We tried to understand the need first and built a product accordingly.

    Enterprises use voice search to find proper nouns: location, brand, product, company, and celebrity names. It is also used for archiving, offering media and entertainment content, and social media listening. The need is to find proper names. However, it is almost impossible to foresee each search when tuning speech-to-text. Re-transcribing already transcribed recordings with extra tuned speech-to-text is also costly. Acknowledging speech-to-text limitations, we built an alternative engine for the Voice Search use case.

  • What can I build with Octopus Speech-to-Index?

    Voice Search is for archiving, making media and entertainment content more discoverable, and social media listening. There are some applications in governance and compliance. Don’t forget to check out use case pages such as Voice Search and Speech Analytics to learn more.

  • Which platforms do Octopus Speech-to-Index support?

    1. Mobile Applications: Android and iOS
    2. Web Browsers : Chrome, Safari, Firefox, and Edge
    3. Desktop and Servers: Linux, macOS, and Windows
  • Which languages does Octopus Speech-to-Index support?

    Octopus Speech-to-Index offers multilingual support for English, French, German, Italian, Japanese, Korean, Portuguese, Spanish, and more.

  • What should I do if I need support for other languages?

    Contact Picovoice Sales and tell us about the opportunity, including the use case, requirements and project details.

  • How do I get technical support for Octopus Speech to Intent?

    Picovoice docs, blogs, Medium posts, and GitHub are great resources to learn about voice recognition, Picovoice engines, and how to start building voice search products. Picovoice also offers GitHub community support to all Free Tier users.

  • How can I get informed about the updates and upgrades?

    Version changes appear in the Picovoice Newsletter, LinkedIn, and Twitter. Subscribing to GitHub is the best way to get notified of the patch releases. If you enjoy building with Octopus, don’t forget to give it a star when you’re on GitHub!

Lead the Voice Revolution!

Build private, fast, cost-effective, cross-platform voice products
Talk to a Voice AI Expert