Orca Streaming Text-to-Speech Engine

Orca is an on-device streaming Text-to-Speech engine. Orca is:

Private; All voice processing runs locally.
Compact and Computationally-Efficient
Optimized for LLMs, enabling low-latency voice assistants.
Cross-Platform:
- Linux (x86_64)
- macOS (x86_64, arm64)
- Windows (x86_64, arm64)
- Android
- iOS
- Web
- Raspberry Pi (3, 4, 5)

EnglishEnglish

FrenchFrançais

GermanDeutsch

ItalianItaliano

Japanese日本語

Korean한국어

PortuguesePortuguês

SpanishEspañol

Get Started

Anyone who is using Picovoice needs to have a valid AccessKey. AccessKey is your authentication and authorization token for using Picovoice. It also verifies that your usage is within the limits of your account. You must keep your AccessKey secret!

Retrieve AccessKey

Download SDK

Picovoice SDKs are available both on GitHub and via SDK-specific package managers. Follow one of the quick starts to synthesize audio offline using Orca with your newly-created AccessKey.

Voices

Orca Streaming Text-to-Speech can synthesize speech with various voices, each of which is characterized by a model file. Currently, Orca Streaming Text-to-Speech offers default 'female' and 'male' voice models. The model files can be downloaded from the Orca GitHub repository. To synthesize speech with a specific voice, provide the associated model file as an argument to the orca init functions.

Custom Voices

For custom voices in any language, Enterprise Plan customers can engage with Picovoice Consulting.

Speech control

Orca Streaming Text-to-Speech provides a set of parameters to control the synthesized speech. While the default speech rate is 1, values between 0.7 and 1.3 can be used to generate faster or slower speech.

Custom Pronunciation

Orca Streaming Text-to-Speech supports custom pronunciation which can be expressed in ARPAbet format for English and different subsets of IPA for our other languages.

English

Vowels

ARPAbet

Example

balm

cat

sun

dog

bout

red

her

they

sit

see

boat

toy

book

food

Consonants

ARPAbet

Example

book

chair

dog

this

fan

hat

jump

kite

lamp

map

net

ring

pen

run

sun

ship

top

thin

van

wet

yes

zoo

measure

French

Vowels

IPA

Example

patte

ã

sans

bète

ɛ̃

vin

vie

rose

õ

non

sort

lune

lut

peur

sœur

œ̃

parfum

fief

oui

nuit

Consonants

IPA

Example

bon

deux

faire

longue

carte

lire

main

nom

mignon

camping

pain

rue

soleil

chien

temps

ville

mais

jour

German

Vowels

IPA

Example

Papa

aɪ

Bein

aʊ

Haus

ã

Chance

geben

mit

tot

õ

Balkon

Mut

Psychologie

schön

ø̃

Parfum

Söhne

kommen

ɔi

flugzeug

bitte

Ende

ɛː

Bär

ɛ̃

Pointe

immer

mit

Buch

müssen

Consonants

IPA

Example

Buch

ich

Dach

Fisch

Garten

Haus

Katze

Lampe

l̩

bittl

Mutter

m̩

großem

Nase

n̩

bitten

klingt

Person

Pfeffer

Rose

Zahl

Hase

Schule

Tag

Cello

Vater

Platz

Dschungel

Genie

Wasser

Italian

Vowels

IPA

Example

casa

pera

bèllo

filo

sole

còsa

luce

piano

guaio

Consonants

IPA

Example

bene

dire

mezzo

dʣ

azzurro

dʒ

gelo

ddʒ

maggio

fiore

gatto

cosa

luce

famiglia

mano

nave

banca

bagno

ŋg

angolo

ŋk

ancora

pane

rosa

terra

sole

scena

tempo

razza

tʃ

cibo

tʧ

feccia

tts

tazza

vento

azzurro

Japanese

Vowels

IPA

Example

aru (ある)

eki (えき)

iru (いる)

oni (おに)

unagi (うなぎ)

Consonants

IPA

Example

basho (ばしょ)

babbān (バッバーン)

byōki (びょうき)

hito (ひと)

shita (した)

ɕɕ

hasshak (ハッシャク)

dōmo (どうも)

zazen (ざぜん)

ʣʣ

kizzu (キッズ)

beddo (ベッド)

De~yuetto (デュエット)

fuyajou (フヤジョウ)

gakkō (がっこう)

ɡj

kigyō (きぎょう)

hon (ほん)

kohheru (コッヘル)

yuzu (ゆず)

kuru (くる)

kyōkai (きょうかい)

gekkō (ゲッコウ)

kkj

tokkyo (トッキョ)

mikan (みかん)

myaku (みゃく)

nattō (なっとう)

niwa (にわ)

nihon (にほん)

pan (パン)

happyō (はっぴょう)

happī (ハッピー)

ppj

toppyoushi (トッピョウシ)

roku (ろく)

ɾj

ryōri (りょうり)

suru (する)

shissou (シッソウ)

taberu (たべる)

setta (セッタ)

tsunami (つなみ)

ʦʦ

tetsui (テッツイ)

warudakumi (ワルダクミ)

dugong (ジュゴン)

ʤʤ

jajjame (ジャッジャメ)

chizu (チズ)

ʧʧ

ottchan (オッチャン)

Korean

Vowels

IPA

Example

agi (아기)

eomeoni (어머니)

aegyo (애교)

enuli (에누리)

ibal (이발)

oli (오리)

oegug (외국)

usan (우산)

wau (와우)

weding (웨딩)

wigi (위기)

wɛ

wae (왜)

wʌ

wonang (워낭)

deudieo (드디어)

ɯi

uisa (의사)

jɛ

yaegi (예기)

jʌ

yeogi (여기)

jagu (야구)

yesul (예술)

yoli (요리)

yuzu (유자)

Consonants

IPA

Example

sibi (시비)

ppeokkugi (뻐꾸기)

gido (기도)

ttalgi (딸기)

gija (기자)

jigu (지구)

kkoma (꼬마)

hyogwa (효과)

gudu (구두)

kʰ

kadeu (카드)

lamyeon (라면)

megi (메기)

namu (나무)

jang-gi (장기)

jib (집)

pʰ

podo (포도)

goli (고리)

sagwa (사과)

ssaum (싸움)

badchim (받침)

tʰ

toyoil (토요일)

jeoul (저울)

ʨʨ

jjukkumi (쭈꾸미)

ʨʰ

chigwa (치과)

ihae (이해)

Portuguese

Vowels

IPA

Example

meto

ẽ

incenso

ĩ

assim

bola

õ

bom

luz

ũ

mundo

papa

ɐ̃

mãe

pór

médio

Consonants

IPA

Example

abacaxi

cada

dʒ

jaze

faca

água

aire

caça

alma

amor

banho

papel

para

carro

casa

ato

tʃ

chava

avó

quando

azul

canha

ache

filho

ajuda

Spanish

Vowels

IPA

Example

casa

leche

silla

sol

muro

rey

huevo

Consonants

IPA

Example

beso

obra

día

adeptar

frío

gato

agua

ayuda

tako

lago

mano

énfasis

noche

señor

banco

pan

rosa

cara

sol

cielo

taza

chico

México

calle

mismo

Emotional Control

Custom Orca Streaming Text-to-Speech models generate voices with emotions and styles, including joy, anger, whispering, and shouting. Enterprise Plan customers can engage with Picovoice Consulting for emotional control.

Was this doc helpful?

Issue with this doc?

Orca Streaming Text-to-Speech Engine

Get Started

Sign up for Picovoice Console

Retrieve AccessKey

Download SDK

Voices

Custom Voices

Speech control

Custom Pronunciation

English

Vowels

Consonants

French

Vowels

Consonants

German

Vowels

Consonants

Italian

Vowels

Consonants

Japanese

Vowels

Consonants

Korean

Vowels

Consonants

Portuguese

Vowels

Consonants

Spanish

Vowels

Consonants

Emotional Control