mcuPicovoice Shepherd - i.MX RT1050

  • Edge Voice Interface
  • End-to-End Voice Platform
  • Offline Voice Recognition
  • Local Speech Recognition
  • Speech-to-Intent
  • Domain-Specific NLU
  • Wake Word Detection
  • i.MX RT
  • iMXRT1050-EVKB

The i.MX RT1050 Evaluation Kit is a development board featuring an i.MX RT1050 crossover MCU based on Arm Cortex®-M7. It has 512KB of on-chip RAM. The Cortex-M7 core operates at up to 600MHz.

imxrt1050 discovery board overview

Picovoice Shepherd is the first no-code platform for building voice interfaces on microcontrollers. It enables creating voice experiences similar to Alexa that run entirely on microcontrollers. Picovoice Shepherd accelerates prototyping, mitigates technical risks, and shortens time to market. Paired with Picovoice Console users can deploy custom voice models into microcontrollers instantly.

Requirements

  • Python 3

Compatibility

  • Linux (x86_64)
  • macOS (x86_64)
  • Windows (x86_64)

Installation

Install the Picovoice Shepherd:

pip3 install pvshepherd

Note for macOS

Install Python using either the official installer or Homebrew. Shepherd cannot run using the Python shipped with macOS. If using the Homebrew Python, make sure that /usr/local/bin is in the PATH variable before installing Shepherd.

Note for Windows

The default Python installation options do not add it to the Windows PATH variable. To fix the issue, refer to this link.

Usage

Run the following command from the terminal:

pvshepherd

Upload the Picovoice Firmware

First, connect the board to the computer using a MICRO-USB Type-B cable. i.MX RT1050 Evaluation has three USB ports on it; Make sure that the USB port on the top right side of the board is used as shown in the figure below:

imxrt1050 from top

Select iMXRT1050-EVKB on the first page.

Selecting your board and press the upload button at the bottom right corner of the window

Press the Upload Firmware button and wait for completion.

Upload The Default Models

The unique universal identifier (UUID) of Microcontroller on the board is at the top. You need this UUID to create custom models using Picovoice Console. It is also displayed on the board's LCD. For now, let's continue with the default models. Upload the default voice models to the board by pressing Use Default Models.

Selecting your models and press the upload button at the bottom right corner of the window

Test the Default Models

The i.MX RT1050 evaluation board is ready to test. It has started processing the audio input from the microphone in real-time. It writes to the Shepherd console when the Picovoice engine detects utterances of the given wake word and follow-on voice commands. Say:

Picovoice, make the living room color purple

Picovoice will detect the occurrence of the custom wake word (Picovoice), and then extracts the intent from the follow-on spoken command:

{
is_understood : True,
intent : changeColor,
slots {
location : living room,
color : purple,
}
}

The Show Context button opens a new window and lists all the available voice commands.

Monitoring the board activities and changing its parameters

The volume and CPU usage are on the top left. The inference sensitivity of the engines can be changed on the fly. The sensitivity parameter controls the tradeoff between the miss rate and false alarm. A higher sensitivity reduces the miss rate (false reject rate) at the cost of increased false alarm rate.

Audio Debugging

You can record and save the audio fed to the Picovoice from Shepherd. Go to the Audio debugging tab and click on the Record Audio button.

Saving audio recorded by the board for debugging

Create Custom Models

  1. Go back to the Upload Model page and copy the UUID to the clipboard using the Copy button.
  2. Go to Picovoice Console to create models for Porcupine wake word engine and Rhino Speech-to-Intent engine.
  3. Select Arm Cortex-M as the platform when training the model.
  4. Select IMXRT as the board type and provide the UUID of the chipset on the board.
create custom wake-word for the board

Upload the Custom Models

  1. Download your custom voice model(s) from Picovoice Console.
  2. Decompress the zip file. The model file is either .ppn for Porcupine wake word or .rhn for Rhino Speech-to-Intent.
  3. Go to the Upload Model page and select the models.
  4. Press the Upload button.
upload the custom models to the board

Issue with this doc? Please let us know.