Picovoice Wordmark
Start Building
Introduction
Introduction
AndroidC.NETiOSLinuxmacOSNode.jsPythonRaspberry PiWebWindows
AndroidC.NETiOSNode.jsPythonWeb
SummaryPicovoice picoLLMGPTQ
Introduction
AndroidC.NETFlutteriOSJavaLinuxmacOSNode.jsPythonRaspberry PiReactReact NativeRustWebWindows
AndroidC.NETFlutteriOSJavaNode.jsPythonReactReact NativeRustWeb
SummaryPicovoice LeopardAmazon TranscribeAzure Speech-to-TextGoogle ASRGoogle ASR (Enhanced)IBM Watson Speech-to-TextWhisper Speech-to-Text
FAQ
Introduction
AndroidC.NETFlutteriOSJavaLinuxmacOSNode.jsPythonRaspberry PiReactReact NativeRustWebWindows
AndroidC.NETFlutteriOSJavaNode.jsPythonReactReact NativeRustWeb
SummaryPicovoice Cheetah
FAQ
Introduction
AndroidC.NETiOSLinuxmacOSNode.jsPythonRaspberry PiWebWindows
AndroidC.NETiOSNode.jsPythonWeb
SummaryAmazon PollyAzure TTSElevenLabsOpenAI TTSPicovoice Orca
Introduction
AndroidCiOSLinuxmacOSPythonRaspberry PiWebWindows
AndroidCiOSPythonWeb
SummaryPicovoice KoalaMozilla RNNoise
Introduction
AndroidCiOSLinuxmacOSNode.jsPythonRaspberry PiWebWindows
AndroidCNode.jsPythoniOSWeb
SummaryPicovoice EaglepyannoteSpeechBrainWeSpeaker
Introduction
AndroidCiOSLinuxmacOSPythonRaspberry PiWebWindows
AndroidCiOSPythonWeb
SummaryPicovoice FalconAmazon TranscribeAzure Speech-to-TextGoogle Speech-to-Textpyannote
Introduction
AndroidArduinoCChrome.NETEdgeFirefoxFlutteriOSJavaLinuxmacOSMicrocontrollerNode.jsPythonRaspberry PiReactReact NativeRustSafariUnityWebWindows
AndroidC.NETFlutteriOSJavaMicrocontrollerNode.jsPythonReactReact NativeRustUnityWeb
SummaryPorcupineSnowboyPocketSphinx
Wake Word TipsFAQ
Introduction
AndroidCChrome.NETEdgeFirefoxFlutteriOSJavaLinuxmacOSNode.jsPythonRaspberry PiReactReact NativeRustSafariUnityWebWindows
AndroidC.NETFlutteriOSJavaNode.jsPythonReactReact NativeRustUnityWeb
SummaryPicovoice RhinoGoogle DialogflowAmazon LexIBM WatsonMicrosoft LUIS
Expression SyntaxFAQ
Introduction
AndroidC.NETiOSLinuxmacOSNode.jsPythonRaspberry PiRustWebWindows
AndroidC.NETiOSNode.jsPythonRustWeb
SummaryPicovoice CobraWebRTC VAD
FAQ
Introduction
AndroidC.NETFlutteriOSNode.jsPythonReact NativeRustUnityWeb
AndroidC.NETFlutteriOSNode.jsPythonReact NativeRustUnityWeb
Introduction
C.NETNode.jsPython
C.NETNode.jsPython
FAQGlossary

Orca Streaming Text-to-Speech Engine

Orca is an on-device streaming Text-to-Speech engine. Orca is:

  • Private; All voice processing runs locally.
  • Compact and Computationally-Efficient
  • Optimized for LLMs, enabling low-latency voice assistants.
  • Cross-Platform:
    • Linux (x86_64)
    • macOS (x86_64, arm64)
    • Windows (x86_64, arm64)
    • Android
    • iOS
    • Web
    • Raspberry Pi (3, 4, 5)
EnglishEnglish
FrenchFrançais
GermanDeutsch
ItalianItaliano
Japanese日本語
Korean한국어
PortuguesePortuguês
SpanishEspañol

Get Started

Anyone who is using Picovoice needs to have a valid AccessKey. AccessKey is your authentication and authorization token for using Picovoice. It also verifies that your usage is within the limits of your account. You must keep your AccessKey secret!

Sign up for Picovoice Console

Sign up for Picovoice Console. It is free, no credit card required.

Retrieve AccessKey

Log in to your account to get your AccessKey.

Download SDK

Picovoice SDKs are available both on GitHub and via SDK-specific package managers. Follow one of the quick starts to synthesize audio offline using Orca with your newly-created AccessKey.

Voices

Orca Streaming Text-to-Speech can synthesize speech with various voices, each of which is characterized by a model file. Currently, Orca Streaming Text-to-Speech offers default 'female' and 'male' voice models. The model files can be downloaded from the Orca GitHub repository. To synthesize speech with a specific voice, provide the associated model file as an argument to the orca init functions.

Custom Voices

For custom voices in any language, Enterprise Plan customers can engage with Picovoice Consulting.

Speech control

Orca Streaming Text-to-Speech provides a set of parameters to control the synthesized speech. While the default speech rate is 1, values between 0.7 and 1.3 can be used to generate faster or slower speech.

Custom Pronunciation

Orca Streaming Text-to-Speech supports custom pronunciation which can be expressed in ARPAbet format for English and different subsets of IPA for our other languages.

English

Vowels

ARPAbet
Example
AA
balm
AE
cat
AH
sun
AO
dog
AW
bout
AY
my
EH
red
ER
her
EY
they
IH
sit
IY
see
OW
boat
OY
toy
UH
book
UW
food

Consonants

ARPAbet
Example
B
book
CH
chair
D
dog
DH
this
F
fan
G
go
HH
hat
JH
jump
K
kite
L
lamp
M
map
N
net
NG
ring
P
pen
R
run
S
sun
SH
ship
T
top
TH
thin
V
van
W
wet
Y
yes
Z
zoo
ZH
measure

French

Vowels

IPA
Example
a
patte
ã
sans
e
et
ɛ
bète
ɛ̃
vin
ə
je
i
vie
o
rose
õ
non
ɔ
sort
u
lune
y
lut
ø
peur
œ
sœur
œ̃
parfum
j
fief
w
oui
ɥ
nuit

Consonants

IPA
Example
b
bon
d
deux
f
faire
g
longue
k
carte
l
lire
m
main
n
nom
ɲ
mignon
ŋ
camping
p
pain
ʁ
rue
s
soleil
ʃ
chien
t
temps
v
ville
z
mais
ʒ
jour

German

Vowels

IPA
Example
a
Papa
aɪ
Bein
aʊ
Haus
ã
Chance
e
geben
i
mit
o
tot
õ
Balkon
u
Mut
y
Psychologie
ø
schön
ø̃
Parfum
œ
Söhne
ɔ
kommen
ɔi
flugzeug
ə
bitte
ɛ
Ende
ɛː
Bär
ɛ̃
Pointe
ɐ
immer
ɪ
mit
ʊ
Buch
ʏ
müssen

Consonants

IPA
Example
b
Buch
x
Buch
ç
ich
d
Dach
f
Fisch
g
Garten
h
Haus
j
Ja
k
Katze
l
Lampe
l̩
bittl
m
Mutter
m̩
großem
n
Nase
n̩
bitten
ŋ
klingt
p
Person
pf
Pfeffer
r
Rose
z
Zahl
s
Hase
ʃ
Schule
t
Tag
ʧ
Cello
v
Vater
ʦ
Platz
ʤ
Dschungel
ʒ
Genie
w
Wasser

Italian

Vowels

IPA
Example
a
casa
e
pera
ɛ
bèllo
i
filo
o
sole
ɔ
còsa
u
luce
j
piano
w
guaio

Consonants

IPA
Example
b
bene
d
dire
dz
mezzo
dʣ
azzurro
dʒ
gelo
ddʒ
maggio
f
fiore
g
gatto
k
cosa
l
luce
ʎ
famiglia
m
mano
n
nave
ŋ
banca
ɲ
bagno
ŋg
angolo
ŋk
ancora
p
pane
r
rosa
rr
terra
s
sole
ʃ
scena
t
tempo
ts
razza
tʃ
cibo
tʧ
feccia
tts
tazza
v
vento
z
azzurro

Japanese

Vowels

IPA
Example
a
aru (ある)
e
eki (えき)
i
iru (いる)
o
oni (おに)
ɯ
unagi (うなぎ)

Consonants

IPA
Example
b
basho (ばしょ)
bb
babbān (バッバーン)
bj
byōki (びょうき)
ç
hito (ひと)
ɕ
shita (した)
ɕɕ
hasshak (ハッシャク)
d
dōmo (どうも)
ʣ
zazen (ざぜん)
ʣʣ
kizzu (キッズ)
dd
beddo (ベッド)
dj
De~yuetto (デュエット)
f
fuyajou (フヤジョウ)
ɡ
gakkō (がっこう)
ɡj
kigyō (きぎょう)
h
hon (ほん)
hh
kohheru (コッヘル)
j
yuzu (ゆず)
k
kuru (くる)
kj
kyōkai (きょうかい)
kk
gekkō (ゲッコウ)
kkj
tokkyo (トッキョ)
m
mikan (みかん)
mj
myaku (みゃく)
n
nattō (なっとう)
ɲ
niwa (にわ)
ɴ
nihon (にほん)
p
pan (パン)
pj
happyō (はっぴょう)
pp
happī (ハッピー)
ppj
toppyoushi (トッピョウシ)
ɾ
roku (ろく)
ɾj
ryōri (りょうり)
s
suru (する)
ss
shissou (シッソウ)
t
taberu (たべる)
tt
setta (セッタ)
ʦ
tsunami (つなみ)
ʦʦ
tetsui (テッツイ)
w
warudakumi (ワルダクミ)
ʤ
dugong (ジュゴン)
ʤʤ
jajjame (ジャッジャメ)
ʧ
chizu (チズ)
ʧʧ
ottchan (オッチャン)

Korean

Vowels

IPA
Example
a
agi (아기)
ʌ
eomeoni (어머니)
ɛ
aegyo (애교)
e
enuli (에누리)
i
ibal (이발)
o
oli (오리)
ø
oegug (외국)
u
usan (우산)
wa
wau (와우)
we
weding (웨딩)
wi
wigi (위기)
wɛ
wae (왜)
wʌ
wonang (워낭)
ɯ
deudieo (드디어)
ɯi
uisa (의사)
jɛ
yaegi (예기)
jʌ
yeogi (여기)
ja
jagu (야구)
je
yesul (예술)
jo
yoli (요리)
ju
yuzu (유자)

Consonants

IPA
Example
b
sibi (시비)
bb
ppeokkugi (뻐꾸기)
d
gido (기도)
dd
ttalgi (딸기)
ʣ
gija (기자)
g
jigu (지구)
gg
kkoma (꼬마)
h
hyogwa (효과)
k
gudu (구두)
kʰ
kadeu (카드)
l
lamyeon (라면)
m
megi (메기)
n
namu (나무)
ŋ
jang-gi (장기)
p
jib (집)
pʰ
podo (포도)
r
goli (고리)
s
sagwa (사과)
ss
ssaum (싸움)
t
badchim (받침)
tʰ
toyoil (토요일)
ʨ
jeoul (저울)
ʨʨ
jjukkumi (쭈꾸미)
ʨʰ
chigwa (치과)
x
ihae (이해)

Portuguese

Vowels

IPA
Example
e
meto
ẽ
incenso
i
si
ĩ
assim
o
bola
õ
bom
u
luz
ũ
mundo
ɐ
papa
ɐ̃
mãe
ɔ
pór
ɛ
médio

Consonants

IPA
Example
b
abacaxi
d
cada
dʒ
jaze
f
faca
ɡ
água
j
aire
k
caça
l
alma
m
amor
n
banho
p
papel
ɾ
para
ʁ
carro
s
casa
t
ato
tʃ
chava
v
avó
w
quando
z
azul
ɲ
canha
ʃ
ache
ʎ
filho
ʒ
ajuda

Spanish

Vowels

IPA
Example
a
casa
e
leche
i
silla
o
sol
u
muro
j
rey
w
huevo

Consonants

IPA
Example
b
beso
β
obra
d
día
ð
adeptar
f
frío
g
gato
ɣ
agua
ʝ
ayuda
k
tako
l
lago
m
mano
ɱ
énfasis
n
noche
ɲ
señor
ŋ
banco
p
pan
r
rosa
ɾ
cara
s
sol
θ
cielo
t
taza
ʧ
chico
x
México
ʎ
calle
z
mismo

Emotional Control

Custom Orca Streaming Text-to-Speech models generate voices with emotions and styles, including joy, anger, whispering, and shouting. Enterprise Plan customers can engage with Picovoice Consulting for emotional control.

Was this doc helpful?

Issue with this doc?

Report a GitHub Issue
Orca Streaming Text-to-Speech Engine
  • Get Started
  • Sign up for Picovoice Console
  • Retrieve AccessKey
  • Download SDK
  • Voices
  • Custom Voices
  • Speech control
  • Custom Pronunciation
  • English
  • French
  • German
  • Italian
  • Japanese
  • Korean
  • Portuguese
  • Spanish
  • Emotional Control
Voice AI
  • Leopard Speech-to-Text
  • Cheetah Streaming Speech-to-Text
  • Orca Text-to-Speech
  • Koala Noise Suppression
  • Eagle Speaker Recognition
  • Falcon Speaker Diarization
  • Porcupine Wake Word
  • Rhino Speech-to-Intent
  • Cobra Voice Activity Detection
Local LLM
  • picoLLM Inference
  • picoLLM Compression
  • picoLLM GYM
Resources
  • Docs
  • Console
  • Blog
  • Use Cases
  • Playground
Sales & Services
  • Consulting
  • Foundation Plan
  • Enterprise Plan
  • Enterprise Support
Company
  • About us
  • Careers
Follow Picovoice
  • LinkedIn
  • GitHub
  • X
  • YouTube
  • AngelList
Subscribe to our newsletter
Terms of Use
Privacy Policy
© 2019-2025 Picovoice Inc.