deepgram-conversational-demo
v0.4.0
Published
Combine Text-to-Speech and Speech-to-Text into a conversational agent.
Downloads
79
Readme
Deepgram AI Agent Technical Demo
Combine Text-to-Speech and Speech-to-Text into a conversational agent.
Project codename EmilyAI
The purpose of this demo is to showcase how you can build a Conversational AI application that engages users in natural language interactions, mimicking human conversation through natural language processing using Deepgram.
Examples of where you would see this type of application include: virtual assistants for tasks like answering queries and controlling smart devices, educational tutors for personalized learning, healthcare advisors for medical information, and entertainment chat bots for engaging conversations and games.
These applications aim to enhance user experiences by offering efficient and intuitive interactions, reducing the need for human intervention in various tasks and services.
Issue Reporting
If you have found a bug or if you have a feature request, please report them at this repository issues section. Please do not report security vulnerabilities on the public GitHub issue tracker.
Check out our KNOWN ISSUES before reporting.
Demo features
- Capture streaming audio using Deepgram Streaming Speech to Text.
- Natural Language responses using an OpenAI LLM.
- Speech to Text conversion using Deepgram Aura Text to Speech.
What is Deepgram?
Deepgram is a foundational AI company providing speech-to-text and language understanding capabilities to make data readable and actionable by human or machines.
Sign-up to Deepgram
Want to start building using this project? Sign-up now for Deepgram and create an API key.
Quickstart
Manual
Follow these steps to get started with this starter application.
Clone the repository
Go to GitHub and clone the repository.
Install dependencies
Install the project dependencies.
npm install
Edit the config file
Copy the code from sample.env.local
and create a new file called .env.local
.
DEEPGRAM_STT_DOMAIN=https://api.deepgram.com
DEEPGRAM_API_KEY=YOUR-DG-API-KEY
OPENAI_API_KEY=YOUR-OPENAI-API-KEY
- For
DEEPGRAM_API_KEY
paste in the key you generated in the Deepgram console. - Set
DEEPGRAM_STT_DOMAIN
to behttps://api.deepgram.com
. OPENAI_API_KEY
should be an OpenAI API Key that can access the chat completions API.
Run the application
Once running, you can access the application in your browser.
npm run dev
Getting Help
We love to hear from you so if you have questions, comments or find a bug in the project, let us know! You can either:
- Open an issue in this repository
- Join the Deepgram Github Discussions Community
- Join the Deepgram Discord Community
Author
License
This project is licensed under the MIT license. See the LICENSE file for more info.