coquisttjs
v0.1.1
Published
Coqui STT runtime transcript NodeJs client, including server usage examples.
Downloads
11
Maintainers
Readme
CoquiSTTJs
Coqui STT runtime transcript NodeJs client, including multithread (server) usage examples.
What's Coqui STT?
Coqui STT on github is an open-source Speech-To-Text engine, project fork of Mozilla DeepSpeech. Documentation for installation, usage, and training models are available on Coqui STT documentation.
Now, you want to access Coqui STT speech to text transcription, from nodeJs. Coqui STT official native NodeJs API:
✨ Coqui STT run-time decoding, using NodeJs
This project supplies to nodeJs developers a simple API interface on top of the Coqui STT native NodeJs binding. Here the current API javascript endpoints:
loadModel(modelFile, scorerFile)
transcriptBuffer(audioBuffer, model)
transcriptFile(audioFile, model)
freeModel(model)
📦 Install
Install Coqui STT engine, models, audio
# Create a virtual environment
$ python3 -m venv venv-stt
$ source venv-stt/bin/activate
# Install 🐸STT
$ python3 -m pip install -U pip
$ python3 -m pip install stt
# Download 🐸's pre-trained English models
$ curl -LO https://github.com/coqui-ai/STT/releases/download/v0.9.3/coqui-stt-0.9.3-models.pbmm
$ curl -LO https://github.com/coqui-ai/STT/releases/download/v0.9.3/coqui-stt-0.9.3-models.scorer
# Download some example audio files
$ curl -LO https://github.com/coqui-ai/STT/releases/download/v0.9.3/audio-0.9.3.tar.gz
$ tar -xvf audio-0.9.3.tar.gz
# Transcribe an audio file
$ stt --model coqui-stt-0.9.3-models.pbmm --scorer coqui-stt-0.9.3-models.scorer --audio audio/2830-3980-0043.wav
Install CoquiSTTJs
npm install coquisttjs
🛠 Usage examples and tests
Some usage examples and tests. Examples include:
- Simple sentence-based speech-to-text
coquihttp
A simple Speech-to-text HTTP Servercoquihttp
as RHASSPY speech-to-text remote HTTP Server- SocketIO server pseudocode
🎁 Bonus tracks
audioutils.js
fast transcoding to PCM, using ffmpeg process (install ffmpeg before).
🧶 Status
- The project is in a very draft stage.
- Current Coqui STT npm package solve the DeepSpeech
issue.
In facts current npm package
deepspeech
cause a crash using node version 16.0.0. BTW I had success using DeeSpeech npm package with Node version 14.16.1. See my project DeepSpeechJs. - Complete the high-level API interface. E.g. including metadata as parameters
How to contribute
If you like the project, please ⭐️ star this repository to show your support! 🙏
Any contribute is welcome.
- Discussions. Please open a new discussion (a publich chat on github) for any specific open topic, for a clarification, change request proposals, etc.
- Issues Please submit issues for bugs, etc
- e-mail You can contact me privately, via email.
License
MIT (c) Giorgio Robino