Rhino Speech-to-Intent
Android API
API Reference for the Android Rhino SDK (rhino-android)
package: ai.picovoice.rhino
Rhino
Class for the Rhino Speech-to-Intent engine.
Rhino
can be initialized using the Rhino.Builder
Class. Resources should be cleaned
when you are done using the delete()
function.
Rhino.delete()
Releases resources acquired by Rhino
.
Rhino.getFrameLength()
Getter for number of audio samples per frame.
Returns
int
: Number of audio samples per frame.
Rhino.getSampleRate()
Getter for audio sample rate accepted by Picovoice.
Returns
int
: Audio sample rate accepted by Picovoice.
Rhino.getVersion()
Getter for version.
Returns
String
: CurrentRhino
version.
Rhino.getContextInformation()
Getter for context information.
Returns
String
: Returns the context information.
Rhino.process()
Processes a frame of the incoming audio stream and emits a flag indicating if the inference is finalized. The number of
samples per frame can be attained by calling getFrameLength()
. The incoming audio needs to
have a sample rate equal to getSampleRate()
and be 16-bit linearly-encoded. Rhino operates on
single-channel audio.
Parameters
pcm
short[] : A frame of audio samples.
Returns
boolean
: Flag indicating whether the engine has finalized intent inference.
Throws
RhinoException
: If there is an error while processing the audio frame.
Rhino.reset()
Resets the internal state of Rhino
. It should be called before the engine can be used to infer intent from a new stream of audio.
Throws
RhinoException
: If reset fails.
Rhino.getInference()
Gets inference result. If the spoken command was understood, it includes the specific intent name that was inferred,
and (if applicable) slot keys and specific slot values. Should only be called after the process()
function returns true
, otherwise Rhino has not yet reached an inference conclusion.
Returns
RhinoInference
: The result of inference.
Throws
RhinoException
: If inference retrieval fails.
Rhino.Builder
Builder for creating an instance of Rhino
with a mixture of default arguments.
Rhino.Builder.build()
Creates an instance of Rhino
Speech-to-Intent engine.
Returns
Rhino
: An instance of Rhino Speech-to-Intent engine.
Throws
RhinoException
: If an error occurs while creating an instance of Rhino Speech-to-Intent engine.
Rhino.Builder.setAccessKey()
Sets the AccessKey of the builder.
Parameters
accessKey
String : AccessKey obtained from Picovoice Console.
Returns
Rhino.Builder
: Modified Rhino.Builder object.
Rhino.Builder.setContextPath()
Sets the context path of the builder.
Parameters
contextPath
String : Path to the context file (.rhn
). Can be either a path that is relative to the project'sassets
folder or an absolute path to the file on device.
Returns
Rhino.Builder
: Modified Rhino.Builder object.
Rhino.Builder.setModelPath()
Sets the model path of the builder. If not set it will be set to the default location.
Parameters
modelPath
String : Path to the file containing model parameters (.pv
). Can be either a path that is relative to the project'sassets
folder or an absolute path to the file on device.
Returns
Rhino.Builder
: Modified Rhino.Builder object.
Rhino.Builder.setSensitivity()
Sets sensitivity of the builder. Value should be a number within [0, 1]. A higher sensitivity results in fewer misses at the cost of increasing the false alarm rate.
Parameters
sensitivity
float: Sensitivity for inference.
Returns
Rhino.Builder
: Modified Rhino.Builder object.
Rhino.Builder.setEndpointDurationSec()
Sets the endpoint duration in seconds. An endpoint is a chunk of silence at the end of an utterance that marks the end of spoken command. It should be a positive number within [0.5, 5]. A lower endpoint duration reduces delay and improves responsiveness. A higher endpoint duration ensures Rhino doesn't return inference pre-emptively in case the user pauses before finishing the request.
Parameters
endpointDurationSec
float: Endpoint duration in seconds.
Returns
Rhino.Builder
: Modified Rhino.Builder object.
Rhino.Builder.setRequireEndpoint()
Sets requireEndpoint
of the Builder. If set to true
, Rhino requires an endpoint (a chunk of silence) after the
spoken command. If set to false
, Rhino tries to detect silence, but if it cannot, it still will provide inference
regardless. Set to false
only if operating in an environment with overlapping speech (e.g. people talking in the
background).
Parameters
requireEndpoint
boolean: Indicates whether an endpoint is required for Rhino to finish inference.
Returns
Rhino.Builder
: Modified Rhino.Builder object.
RhinoInference
Class that contains Rhino inference data.
RhinoInference.getIsUnderstood()
Getter for the isUnderstood
flag.
Returns
boolean
: Iftrue
,Rhino
understood the inference.
RhinoInference.getIntent()
Getter for the inference intent name.
Returns
String
: Inference intent name.
RhinoInference.getSlots()
Getter for the inference slots and values.
Returns
Map<String, String>
: Map for inference slots and values.
RhinoException
Exception thrown if an error occurs within the Rhino
engine.
Exceptions:
RhinoManager
High-level Android binding for Rhino
Speech-to-Intent engine. It handles recording audio from microphone,
processes it in real-time using Rhino, and notifies the client when an intent is inferred from the spoken command.
RhinoManager.delete()
Releases resources acquired by RhinoManager
.
RhinoManager.getVersion()
Getter for version.
Returns
String
: CurrentRhino
version.
RhinoManager.process()
Start recording audio from the microphone and infers the user's intent from the spoken command. Once the inference is finalized it will invoke the user provided callback and terminate recording.
Throws
RhinoException
: If an error occurs while starting RhinoManager.
RhinoManager.getContextInformation()
Getter for context information.
Returns
String
: Returns the context information.
RhinoManager.Builder
Builder for creating an instance of RhinoManager
with a mixture of default arguments.
RhinoManager.Builder.build()
Creates an instance of RhinoManager
.
Parameters
context
Context : The Android app context.callback
RhinoManagerCallback : A callback function that is invoked when intent has been inferred. The callback must be an instance ofRhinoManagerCallback
.
Returns
RhinoManager
: An instance of RhinoManager.
Throws
RhinoException
: If an error occurs while creating an instance of RhinoManager.
RhinoManager.Builder.setAccessKey()
Sets the AccessKey of the builder.
Parameters
accessKey
String : AccessKey obtained from Picovoice Console.
Returns
RhinoManager.Builder
: The instance of RhinoManager.Builder object.
RhinoManager.Builder.setContextPath()
Sets the context path of the builder.
Parameters
contextPath
String : Path to the context file (.rhn
). Can be either a path that is relative to the project'sassets
folder or an absolute path to the file on device.
Returns
RhinoManager.Builder
: Modified RhinoManager.Builder object.
RhinoManager.Builder.setModelPath()
Sets the model path of the builder. If not set it will be set to the default location.
Parameters
modelPath
String : Path to the file containing model parameters (.pv
). Can be either a path that is relative to the project'sassets
folder or an absolute path to the file on device.
Returns
RhinoManager.Builder
: Modified RhinoManager.Builder object.
RhinoManager.Builder.setSensitivity()
Sets sensitivity of the builder. Value should be a number within [0, 1]. A higher sensitivity results in fewer misses at the cost of increasing the false alarm rate.
Parameters
sensitivity
float: Sensitivity for inference.
Returns
RhinoManager.Builder
: Modified RhinoManager.Builder object.
RhinoManager.Builder.setEndpointDurationSec()
Sets the endpoint duration in seconds. An endpoint is a chunk of silence at the end of an utterance that marks the end of spoken command. It should be a positive number within [0.5, 5]. A lower endpoint duration reduces delay and improves responsiveness. A higher endpoint duration ensures Rhino doesn't return inference pre-emptively in case the user pauses before finishing the request.
Parameters
endpointDurationSec
float: Endpoint duration in seconds.
Returns
RhinoManager.Builder
: Modified RhinoManager.Builder object.
RhinoManager.Builder.setRequireEndpoint()
Sets requireEndpoint
of the Builder. If set to true
, Rhino requires an endpoint (a chunk of silence) after the
spoken command. If set to false
, Rhino tries to detect silence, but if it cannot, it still will provide inference
regardless. Set to false
only if operating in an environment with overlapping speech (e.g. people talking in the
background).
Parameters
requireEndpoint
boolean: Indicates whether an endpoint is required for Rhino to finish inference.
Returns
RhinoManager.Builder
: Modified RhinoManager.Builder object.
RhinoManager.Builder.setErrorCallback()
Sets errorCallback of the builder.
Parameters
errorCallback
RhinoManagerErrorCallback : A function to run if errors occur while processing audio frames.
Returns
RhinoManager.Builder
: Modified RhinoManager.Builder object.
RhinoManagerCallback
Callback interface invoked when inference has been finalized.
RhinoManagerErrorCallback
Callback interface invoked when an error occurs while processing audio.