@picovoice/rhino-node
v3.0.5
Published
Picovoice Rhino Node.js binding
Downloads
259
Readme
Rhino Binding for Node.js
Rhino
Rhino is Picovoice's Speech-to-Intent engine. It directly infers intent from spoken commands within a given context of interest, in real-time. For example, given a spoken command "Can I have a small double-shot espresso?", Rhino infers that the user wants to order a drink and emits the following inference result:
{
"type": "espresso",
"size": "small",
"numberOfShots": "2"
}
Unlike typical NLU inference software, Rhino does not use generic Speech-to-Text transcription, and instead operates on a compact, bespoke model generated for a specific use case; e.g. a coffee maker, or smart home lighting. Unless you deliberately program it to do so, it won't understand phrases like "tell me a joke". Using this approach (combined with Picovoice's proprietary deep learning technology) allows for:
- dramatically improved efficiency (it can even run on tiny microcontrollers)
- accuracy gains from not having to anticipate every possible spoken phrase
- avoiding transcription errors compounding into the intent understanding (e.g. homonyms are much less of an issue, because we probably know which word makes sense).
To learn more about Rhino, see the platform, documentation, and GitHub pages.
Creating a context
To design contexts and train into .rhn
files, see the Picovoice Console.
Files generated with the Picovoice Console carry restrictions including (but not limited to): training allowance, time limits, available platforms, and commercial usage.
Compatibility
This binding is for running Rhino on NodeJS 12+ on the following platforms:
- Windows (x86_64)
- Linux (x86_64)
- macOS (x86_64, arm64)
- Raspberry Pi (3, 4, 5)
Web Browsers
This binding is for Node.js and does not work in a browser. Looking to run Rhino in-browser? There are npm packages available for Web, and dedicated packages for Angular, React, and Vue.
AccessKey
Rhino requires a valid Picovoice AccessKey
at initialization. AccessKey
acts as your credentials when using Rhino SDKs.
You can get your AccessKey
for free. Make sure to keep your AccessKey
secret.
Signup or Login to Picovoice Console to get your AccessKey
.
Usage
The binding provides the Rhino class. Create instances of the Rhino class to make speech inferences within a context.
Quick Start
const {Rhino} = require("@picovoice/rhino-node");
const coffeeMakerContextPath = "./coffee_maker.rhn";
const accessKey = "${ACCESS_KEY}" // Obtained from the Picovoice Console (https://console.picovoice.ai/)
const handle = new Rhino(accessKey, coffeeMakerContextPath);
let isFinalized = false;
// process each frame of audio until Rhino has concluded that it understood the phrase (or did not)
// when Rhino has reached a conclusion, isFinalized will become true
while (!isFinalized) {
isFinalized = handle.process(frame);
// retrieve the inference from Rhino
if (isFinalized) {
const inference = handle.getInference();
// inference result example:
//
// {
// isUnderstood: true,
// intent: 'orderDrink',
// slots: {
// size: 'medium',
// numberOfShots: 'double shot',
// coffeeDrink: 'americano',
// milkAmount: 'lots of milk',
// sugarAmount: 'some sugar'
// }
// }
}
}
// always call release when finished, to free the resources allocated by Rhino
handle.release();
Override model and library paths
The Rhino constructor accepts three optional positional parameters for the sensitivity and the absolute paths to the model and dynamic library, should you need to override them (typically, you will not).
const accessKey = "${ACCESS_KEY}" // Obtained from the Picovoice Console (https://console.picovoice.ai/)
const handle = new Rhino(
accessKey,
contextPath,
sensitivity,
modelFilePath,
libraryFilePath
);
Using the bindings from source
Unit Tests
Run yarn
(or npm install --also=dev
) from the binding/nodejs directory to install project dependencies. This will also run a script to copy all of the necessary shared resources from the Rhino repository into the package directory.
Run yarn test
(or npm run test
) from the binding/nodejs directory to execute the test suite.