echo-e2e

v1.0.2

Published

a month ago

**Testing phone calls is hard. Testing AI-powered phone calls is even harder.**

Downloads

0High
0Medium
0Low

cheike

phone call twilio ngrok openai deepgram cartesia ai calls ai voice testing

Echo - An End-to-End Testing Framework for AI Phone Calls

Testing phone calls is hard. Testing AI-powered phone calls is even harder.

As the owner of Greetmate.ai, we faced challenges in doing quality assurance for our platform. Ensuring that phone calls were testable and consistently performed as expected was very important for us.

Echo simplifies this process by providing an end-to-end testing framework tailored for AI phone calls, enabling you to easily write, execute, and analyze tests.

What Does "End-to-End" Mean?

Echo offers a complete suite of tools out of the box, including:

Ngrok Integration: Seamlessly exposes your local server to the internet for testing.
Twilio Client: Uses Twilio to make and manage phone calls programmatically.
Speech-to-Text and Text-to-Speech Engines: Uses Deepgram and Cartesia for call interactions.

Live Test Output

Key Features

Simple, Intuitive Syntax: Write tests that are easy to understand and maintain.
Dynamic Response Handling: Automatically detect when the next transcript is ready and respond with speech.
Call Recordings: Capture and store recordings for detailed analysis.
Overlap Detection: Identify interruptions or overlapping dialogue.
Transcription Analysis: Transcribe audio recordings for further insights.
Post-Call AI Analysis: Leverage AI to evaluate the quality and content of conversations.

Installation

npm i echo-e2e dotenv

Configuration

To get started, configure your environment variables in a .env file:

# Required
DEEPGRAM_API_KEY=          # API key for Deepgram (speech-to-text)
OPENAI_API_KEY=            # API key for OpenAI (AI analysis)
TWILIO_ACCOUNT_SID=        # Twilio Account SID
TWILIO_AUTH_TOKEN=         # Twilio authentication token
TWILIO_FROM_NUMBER=        # Phone number to initiate calls
NGROK_AUTHTOKEN=           # Ngrok authentication token
CARTESIA_API_KEY=          # API key for Cartesia (text-to-speech)
VOICE_ID=                  # Voice ID for Cartesia

# Optional
MAX_CALL_LIFETIME_SECONDS=600  # Maximum call duration (default: 600 seconds)
TEST_TIMEOUT_MS=900000         # Test timeout in milliseconds (default: 900,000 ms)
EVALUATION_MODEL=gpt-4o        # AI model for post-call evaluation

Make sure you create a directory ./recordings with approriate write permissions.

Writing Tests

Tests should be placed in the ./tests directory. Here's a sample test to get you started:

import 'dotenv/config';
import {call} from 'echo';

await call(
    "The call intro is spoken", // Description of the test
    async ({call, assert}) => {
// Actions during the call
        await call.nextTranscript(); // Wait for the other agent's greeting
        await assert.assertLastTranscriptIncludes('Hello'); // Case insensitive

        await call.speak('Hello back! What is your purpose?');

        await call.nextTranscript();
        await assert.assertLastTranscriptIncludes('help you');
    },
    async ({call, transcribedText, assert}) => {
// Actions after the call (e.g., AI analysis)
        await assert.analyzeTranscribedTextWithAI(
            'The conversation is about the call agent offering their help.'
        );
    },
    {
        to: '+15553334444', // Number to call
    }
);

Running The Tests

 node node_modules/echo/dist/run.js

Optional parameters:

--only will run only the test with the given name (specialtest.test.js becomes specialtest)

Why Choose Echo?

Save time by automating the testing of complex AI-powered phone call workflows. Improve quality assurance by detecting errors and issues before deployment. Harness the power of AI to analyze conversations and provide actionable insights.

Contributing

We welcome contributions! If you’d like to contribute to Echo, please submit a pull request or create an issue to discuss your ideas.

License

Echo is licensed under MIT License.

Echo Function Reference

Below is a reference guide for the functions:

`call`

Description: The primary function for initiating a test call.

Syntax: await call(description, duringCall, afterCall, options);

Parameters:

description (string): A brief description of the test.
duringCall (async function): A callback function defining actions and assertions during the call.
afterCall (async function): A callback function for post-call analysis.
options (object): An optional configuration object for the test.

Options:

to (string): The phone number to call.

`call.nextTranscript`

Description: Waits for the next transcript from the other participant in the call.

Syntax: await call.nextTranscript();

Usage: Use this to wait for the other agent to finish speaking before continuing the test.

`call.speak`

Description: Sends a spoken response during the call using text-to-speech.

Syntax: await call.speak(message);

Parameters:

message (string): The text to be converted to speech and spoken during the call.

`assert.assertLastTranscriptIncludes`

Description: Asserts that the most recent transcript contains a specific phrase or word.

Syntax: await assert.assertLastTranscriptIncludes(expectedText);

Parameters:

expectedText (string): The text to check for in the last transcript.

`assert.analyzeTranscribedTextWithAI`

Description: Performs AI-based analysis on the transcribed text after the call is complete.

Syntax: await assert.analyzeTranscribedTextWithAI(expectedAnalysis);

Parameters:

expectedAnalysis (string): The expected conclusion or content of the conversation to compare against.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Echo - An End-to-End Testing Framework for AI Phone Calls

What Does "End-to-End" Mean?

Key Features

Installation

Configuration

Writing Tests

Running The Tests

Why Choose Echo?

Contributing

License

Echo Function Reference

call

call.nextTranscript

call.speak

assert.assertLastTranscriptIncludes

assert.analyzeTranscribedTextWithAI

`call`

`call.nextTranscript`

`call.speak`

`assert.assertLastTranscriptIncludes`

`assert.analyzeTranscribedTextWithAI`