Picovoice Shepherd - i.MX RT1050
The i.MX RT1050 Evaluation Kit is a development board featuring an i.MX RT1050 crossover MCU based on Arm Cortex®-M7. It has 512KB of on-chip RAM. The Cortex-M7 core operates at up to 600 MHz.
Picovoice Shepherd is the first no-code platform for building voice interfaces on microcontrollers. It enables creating voice experiences similar to Alexa that run entirely on microcontrollers. Picovoice Shepherd accelerates prototyping, mitigates technical risks, and shortens time to market. Paired with Picovoice Console users can deploy custom voice models into microcontrollers instantly.
Platforms
- Linux (x86_64)
- macOS (x86_64, arm64)
- Windows (x86_64)
Requirements
- Picovoice Account and AccessKey
- Python 3
- PIP
- tkinter
- i.MX RT1050 Evaluation Kit
Picovoice Account & AccessKey
Signup or Login to Picovoice Console to get your AccessKey
.
Make sure to keep your AccessKey
secret.
Setup
Install Python 3.
Install the pvshepherd Python package using PIP:
- Install tkinter:
- For Debian based Linux distribution:
- For Linux distribution with yum installer:
- For macOS:
- For Windows:
tkinter
is installed by default.
Usage
Run the following command from the terminal:
Upload the Picovoice Firmware
First, connect the board to the computer using a MICRO-USB Type-B cable. i.MX RT1050 Evaluation has three USB ports on it; Make sure that the USB port on the top right side of the board is used as shown in the figure below:
Select iMXRT1050-EVKB
and the language of your choice on the first page.
Press the Upload Firmware
button and wait for completion.
Upload The Default Models
Get an AccessKey and paste it to the AccessKey field.
The unique universal identifier (UUID) of Microcontroller on the board is at the top. You need this UUID to create custom models using Picovoice Console. It is also displayed on the board's LCD. For now, let's continue with the default models. Upload the default voice models to the board by pressing Use Default Models
.
Test the Default Models
The i.MX RT1050 evaluation board is ready to test. It has started processing the audio input from the microphone in real-time. It writes to the Shepherd console when the Picovoice engine detects utterances of the given wake word and follow-on voice commands. Say:
Picovoice, set the living room lights purple
Picovoice will detect the occurrence of the custom wake word (Picovoice), and then extracts the intent from the follow-on spoken command:
The Show Context
button opens a new window and lists all the available voice commands.
The volume and CPU usage are on the top left. The inference sensitivity of the engines can be changed on the fly. The sensitivity parameter controls the tradeoff between the miss rate and false alarm. A higher sensitivity reduces the miss rate (false reject rate) at the cost of increased false alarm rate.
Audio Debugging
You can record and save the audio fed to the Picovoice from Shepherd. Go to the Audio debugging
tab and click on the Record Audio
button.
Create Custom Models
- Go back to the
Upload Model
page and copy the UUID to the clipboard using theCopy
button. - Go to Picovoice Console to create models for Porcupine wake word engine and Rhino Speech-to-Intent engine.
- Select
Arm Cortex-M
as the platform when training the model. - Select
IMXRT
as the board type and provide the UUID of the chipset on the board.
Upload the Custom Models
- Download your custom voice model(s) from Picovoice Console.
- Decompress the zip file. The model file is either
.ppn
for Porcupine wake word or.rhn
for Rhino Speech-to-Intent. - Go to the
Upload Model
page and select the models. - Press the
Upload
button.
Demo Project
Additionally, there is a demo project on the Picovoice GitHub repository for the i.MX RT1050 Evaluation Kit to ease integrating the designed voice interface into projects.