@humeai/voice-react

v0.1.20

Published

a month ago

<div align="center"> <img src="https://storage.googleapis.com/hume-public-logos/hume/hume-banner.png"> <h1>@humeai/voice-react</h1> <p> <strong>Integrate Hume's Empathic Voice Interface in your React application</strong> </p> </div>

Downloads

7,949

0High
0Medium
0Low

Overview

This package streamlines all of the required state management for building client side applications using the EVI Chat WebSocket through a <VoiceProvider> component and useVoice() hook. It provides a WebSocket, Microphone Interface, Audio Playback Queue, and Message History that are all designed to work closely together.

[!NOTE] This package uses Web APIs for microphone input and audio playback that are not compatible with React Native.

Prerequisites

[!IMPORTANT] This package is built for use within modern web based React applications using a bundler like Next.js, Webpack, or Vite

Before installing this package, please ensure your development environment meets the following requirement:

Node.js (v18.0.0 or higher).

To verify your Node.js version, run this command in your terminal:

node --version

If your Node.js version is below 18.0.0, update it to meet the requirement. For updating Node.js, visit Node.js' official site or use a version management tool like nvm for a more seamless upgrade process.

Installation

Add @humeai/voice-react to your project by running this command in your project directory:

npm install @humeai/voice-react

This will download and include the package in your project, making it ready for import and use within your React components.

import { VoiceProvider } from '@humeai/voice-react';

Usage

Quickstart

To use the SDK, wrap your components in the VoiceProvider, which will enable your components to access available voice methods. Here's a simple example to get you started:

import { VoiceProvider } from '@humeai/voice-react';

function App() {
  const apiKey = process.env.HUME_API_KEY;

  return (
    <VoiceProvider
      auth={{ type: 'apiKey', value: apiKey }}
      configId={/* Optional: Your EVI Configuration ID */}
    >
      {/* ... */}
    </VoiceProvider>
  );
}

Configuring VoiceProvider

See a complete list of props accepted by VoiceProvider below:

`auth`: {value: string; type: "apiKey";} | {value: string; type: "accessToken";}

(Required) Authentication strategy and corresponding value. Authentication is required to establish the web socket connection with Hume's Voice API. See our documentation on obtaining your API key or access token.

`hostname?`: string

(Optional) Hostname of the Hume API. If not provided this value will default to "api.hume.ai".

`reconnectAttempts?`: number

(Optional) Number of times to attempt to reconnect to the API. If not provided this value will default to 30.

`debug?`: boolean

(Optional) Enable debug mode. If not provided this value will default to false.

`configId?`: string

(Optional) If you have a configuration ID with voice presets, pass the config ID here.

`configVersion?`: string

(Optional) If you wish to use a specific version of your config, pass in the version ID here.

`verboseTranscription?`: boolean

(Optional) A flag to enable verbose transcription. When true, unfinalized user transcripts are sent to the client as interim UserMessage messages, which makes the assistant more sensitive to interruptions. Defaults to true.

`onMessage?`: (message: JsonMessage & { receivedAt: Date;}) => void

(Optional) Callback function to invoke upon receiving a message through the web socket.

`onToolCall?`: ToolCallHandler

(Optional) Callback function to invoke upon receiving a ToolCallMessage through the web socket. It will send the string returned as a the content of a ToolResponseMessage. This is where you should add logic that handles your custom tool calls.

`onAudioReceived?`: (message: AudioOutputMessage) => void

(Optional) Callback function to invoke when an audio output message is received from the websocket.

`onAudioStart?`: (clipId: string) => void

(Optional) Callback function to invoke when an audio clip from the assistant starts playing.

`onAudioEnd?`: (clipId: string) => void

(Optional) Callback function to invoke when an audio clip from the assistant stops playing.

`onInterruption?`: (clipId: string) => void

(Optional) Callback function to invoke when the assistant is interrupted.

`onClose?`: (event: CloseEvent) => void

(Optional) Callback function to invoke upon the web socket connection being closed.

`clearMessagesOnDisconnect?`: boolean

(Optional) Boolean which indicates whether you want to clear message history when the call ends.

`messageHistoryLimit?`: number

(Optional) Set the number of messages that you wish to keep over the course of the conversation. The default value is 100.

`sessionSettings?`: SessionSettings

(Optional) Settings where you can set custom values for the session

`resumedGroupChatId?`: string

(Optional) Include a chat group ID, which enables the chat to continue from a previous chat group.

Using the Voice

After you have set up your voice provider, you will be able to access various properties and methods to use the voice in your application. In any component that is a child of VoiceProvider, access these methods by importing the useVoice custom hook.

For example, to include a button to start a call, you could create a button like this:

import { useVoice } from '@humeai/voice-react';

export function StartCallButton() {
  const { connect } = useVoice();

  return <button onClick={() => connect()}>Start Call</button>;
}

[!IMPORTANT] Under the hood, the React SDK uses the AudioContext API, which must be initialized by a user gesture.
:white_check_mark: CORRECT: call connect on a button click.
:x: INCORRECT: call connect in a useEffect to start a call on component mount.

Methods

`connect`: () => Promise

Opens a socket connection to the voice API and initializes the microphone.

`disconnect`: () => void

Disconnect from the voice API and microphone.

`clearMessages`: () => void

Clear transcript messages from history.

`mute`: () => void

Mute the microphone

`unmute`: () => void

Unmute the microphone

`muteAudio`: () => void

Mute the assistant audio

`unmuteAudio`: () => void

Unmute the assistant audio

`sendSessionSettings`: (message: SessionSettings) => void

Send new session settings to the assistant. This overrides any session settings that were passed as props to the VoiceProvider.

`sendUserInput`: (text: string) => void

Send a user input message.

`sendAssistantInput`: (text: string) => void

Send a text string for the assistant to read out loud.

`sendToolMessage`: (toolMessage: ToolResponse | ToolError) => void

Send a tool response or tool error message to the EVI backend.

`pauseAssistant`: () => void

Pauses responses from EVI. Chat history is still saved and sent after resuming.

`resumeAssistant`: () => void

Resumes responses from EVI. Chat history sent while paused will now be sent.

Properties

`isMuted`: boolean

Boolean that describes whether the microphone is muted.

`isAudioMuted`: boolean

Boolean that describes whether the assistant audio is muted.

`isPlaying`: boolean

Describes whether the assistant audio is currently playing.

`isPaused`: boolean

Boolean that describes whether the assistant is paused. When paused, the assistant will still be listening, but will not send a response until it is resumed.

`fft`: number[]

Audio FFT values for the assistant audio output.

`micFft`: number[]

Audio FFT values for microphone input.

`messages`: UserTranscriptMessage | AssistantTranscriptMessage | ConnectionMessage | UserInterruptionMessage | JSONErrorMessage

Message history of the current conversation. By default, messages does not include interim user messages when verboseTranscription is set to true on the VoiceProvider (verboseTranscription is true by default). To access interim messages, you can define a custom onMessage callback on your VoiceProvider.

`lastVoiceMessage`: AssistantTranscriptMessage | null

The last transcript message received from the assistant.

`lastUserMessage`: UserTranscriptMessage | null

The last transcript message received from the user.

`readyState`: VoiceReadyState

The current readyState of the websocket connection.

`status`: VoiceStatus

The current status of the voice connection. Informs you of whether the voice is connected, disconnected, connecting, or error. If the voice is in an error state, it will automatically disconnect from the websocket and microphone.

`error`: VoiceError

Provides more detailed error information if the voice is in an error state.

`isError`: boolean

If true, the voice is in an error state.

`isAudioError`: boolean

If true, an audio playback error has occurred.

`isMicrophoneError`: boolean

If true, a microphone error has occurred.

`isSocketError`: boolean

If true, there was an error connecting to the websocket.

`callDurationTimestamp`: string | null

The length of a call. This value persists after the conversation has ended.

`toolStatusStore`: Record<string, { call?: ToolCall; resolved?: ToolResponse | ToolError }>

A map of tool call IDs to their associated tool messages.

`chatMetadata`: ChatMetadataMessage | null

Metadata about the current chat, including chat ID, chat group ID, and request ID.

`playerQueueLength`: number

The number of assistant audio clips that are queued up, including the clip that is currently playing.

Support

If you have questions or require assistance pertaining to this package, reach out to us on Discord!

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Overview

Prerequisites

Installation

Usage

Quickstart

Configuring VoiceProvider

auth: {value: string; type: "apiKey";} | {value: string; type: "accessToken";}

hostname?: string

reconnectAttempts?: number

debug?: boolean

configId?: string

configVersion?: string

verboseTranscription?: boolean

onMessage?: (message: JsonMessage & { receivedAt: Date;}) => void

onToolCall?: ToolCallHandler

onAudioReceived?: (message: AudioOutputMessage) => void

onAudioStart?: (clipId: string) => void

onAudioEnd?: (clipId: string) => void

onInterruption?: (clipId: string) => void

onClose?: (event: CloseEvent) => void

clearMessagesOnDisconnect?: boolean

messageHistoryLimit?: number

sessionSettings?: SessionSettings

resumedGroupChatId?: string

Using the Voice

Methods

connect: () => Promise

disconnect: () => void

clearMessages: () => void

mute: () => void

unmute: () => void

muteAudio: () => void

unmuteAudio: () => void

sendSessionSettings: (message: SessionSettings) => void

sendUserInput: (text: string) => void

sendAssistantInput: (text: string) => void

sendToolMessage: (toolMessage: ToolResponse | ToolError) => void

pauseAssistant: () => void

resumeAssistant: () => void