Speaker Recognition for Web using JavaScript

🚀 Best-in-class Voice AI!

Build compliant and low-latency AI apps running within web browsers without sending user data to 3rd party servers.

Speaker Recognition is the process of analyzing unique voice characteristics to identify and verify individuals, enabling applications such as voice authentication, speaker-based personalization, and speaker spotting.

A major challenge for many Speaker Recognition applications is the high latency of cloud-based services, which leads to poor user experience. Fortunately, Picovoice's Eagle Speaker Recognition provides on-device Speaker Recognition, effectively bypassing the issues posed by cloud usage without sacrificing accuracy.

In just a few lines of code, you can start performing Speaker Recognition with a microphone using the Eagle Speaker Recognition Web SDK. Let’s get started!

Install Eagle Speaker Recognition Web SDK

Install the Eagle Speaker Recognition Web SDK using npm:

npm install @picovoice/eagle-web

Next, create a Picovoice Console account, and copy your AccessKey from the main dashboard. Creating an account is free, and no credit card is required!

Usage

Eagle Speaker Recognition Model

Add the Eagle Speaker Recognition model to the project by:

Either copying the model file to the project's public directory:

cp ${EAGLE_PARAMS_PATH} ${PUBLIC_DIRECTORY}/${EAGLE_PARAMS}

(or)

Create a base64 string of the model using the pvbase64 script included in the package:

npx pvbase64 -i ${EAGLE_PARAMS_PATH} -o ${OUTPUT_DIRECTORY}/${MODEL_NAME}.js

Create an object containing the Eagle model options:

import base64model from '${OUTPUT_DIRECTORY}/${MODEL_NAME}.js'

const eagleModel = {
  publicPath: '${PUBLIC_DIRECTORY}/${EAGLE_PARAMS}',
  // or
  base64: base64model,
}

Speaker Enrollment

Initialize an Eagle Profiler with the eagleModel variable containing the model options:

import { EagleProfilerWorker } from "@picovoice/eagle-web";

const eagleProfiler = await EagleProfilerWorker.create(
    "${ACCESS_KEY}",
    eagleModel
);

EagleWorker uses web workers to process audio data. Web workers might not be supported (i.e. Firefox private mode). In this case, use Eagle instead, which uses the main thread to process audio data.

Enroll speakers

The .enroll() function takes in frames of audio and provides feedback on the audio quality and Enrollment percentage. Use the percentage value to determine when Enrollment is completed and another speaker can be enrolled:

import {
  EagleProfilerEnrollResult,
  EagleProfilerEnrollFeedback,
} from "@picovoice/eagle-web";

function getAudioData(numSamples): Int16Array {
  // get audio frame of size `numSamples`
}

let percentage = 0;
while (percentage < 100) {
  const audioData = getAudioData(eagleProfiler.minEnrollSamples);
  const result: EagleProfilerEnrollResult = await eagleProfiler.enroll(audioData);  
  switch (result.feedback) {
    case EagleProfilerEnrollFeedback.AUDIO_OK:
    case EagleProfilerEnrollFeedback.AUDIO_TOO_SHORT:
    case EagleProfilerEnrollFeedback.UNKNOWN_SPEAKER:
    case EagleProfilerEnrollFeedback.NO_VOICE_FOUND:
    case EagleProfilerEnrollFeedback.QUALITY_ISSUE:
  }
  
  percentage = result.percentage;
}

Once Enrollment reaches 100%, export the speaker profile to use in the next step, Speaker Recognition:

const speakerProfile = eagleProfiler.export()

Profiles can be made for additional users by calling the .reset() function on the EagleProfiler, and repeating the .enroll() step.

Once profiles have been created for all speakers, make sure to clean up used resources:

eagleProfiler.release();