sofya.transcription

v0.0.10

Published

a month ago

a JavaScript library that provides a robust and flexible solution for real-time audio transcription. It is designed to transcribe audio streams and can be easily integrated into web applications.

Downloads

285

0High
0Medium
0Low

developersofyaai

Speech to Text Transcription audio capture stream MediaStream

Sofya Transcription

Sofya Transcription is a JavaScript library that provides a robust and flexible solution for real-time audio transcription. It is designed to transcribe audio streams and can be easily integrated into web applications. The library also includes a functionality for capturing audio from media elements.

Features

Real-Time Transcription: Transcribe audio streams in real time with high accuracy.
Flexible Integration: Seamlessly integrates with your web applications.
Media Element Audio Capture: Feature to capture audio from media elements like <video> and <audio>.

Installation

To install Sofya Transcription, you can use npm:

npm install sofya.transcription

Usage

Here's a basic example of how to use Sofya Transcription in your project:

Import the Library:
import { MediaElementAudioCapture, TranscriptionService } from 'sofya.transcription';
Create a Transcription Service Instance: const transcriptionConfig = {language: 'pt-BR', model: 'YOUR_MODEL_ID', region: 'YOUR_REGION'} const transcriber = new TranscriptionService('YOUR_API_KEY', transcriptionConfig);
Initialize and Start Transcription:
const mediaStream = await navigator.mediaDevices.getUserMedia(); transcriber.startTranscription(mediaStream);
Handle Transcription Events:
transcriber.on('recognizing', (text) => { console.log('Recognizing: ' + text); });
transcriber.on('recognized', (text) => { console.log('Recognized: ' + text); });
transcriber.on('nomatch', () => { console.log('No match found.'); });
Stop Transcription:
transcriber.stopTranscription();

API

`TranscriptionService`

constructor('YOUR_API_KEY', config): Creates a new instance of the transcription service with a given API KEY and an config object.
startTranscription(mediaStream): void: Starts the transcription process with a given MediaStream.
stopTranscription(): void: Stops the transcription process.
pauseTranscription(): void: Pauses the transcription process.
resumeTranscription(): void: Resumes the transcription process.
on(event: string, callback: Function): void: Registers an event handler for transcription events. Possible events include:
- recognizing: Fired when transcription is in progress.
- recognized: Fired when transcription is complete.
- nomatch: Fired when no speech is recognized.
React Example

import React from 'react'
import { SofyaTranscriber } from 'sofya.transcription'
const App = () => {
  const transcriberRef = React.useRef<SofyaTranscriber | null>(null)
  const [transcription, setTranscription] = React.useState('')
  const transcriptionRef = React.useRef('')

  const getMediaStream = async () => {
    const stream = await navigator.mediaDevices.getUserMedia({ audio: true })
    return stream
  }

  const startTranscription = async () => {
    try {
      const stream = await getMediaStream()
      const transcriber = new SofyaTranscriber(
        'your_key',
        {
          language: 'pt-BR',
        }
      )

      transcriber.on("ready", () => {
        transcriber.startTranscription(stream)
      })
      transcriber.on('recognizing', (result: string) => {
        console.log({ recognizing: result })
        setTranscription(transcriptionRef.current + result)
      })

      transcriber.on('recognized', (result: string) => {
        console.log({ recognized: result })
        setTranscription(transcriptionRef.current + result)
        transcriptionRef.current = transcriptionRef.current + result
      })
      transcriberRef.current = transcriber
    } catch (error) {
      console.log({ error })
    }
  }

  const stopTranscription = () => {
    if (transcriberRef.current) {
      transcriberRef.current.stopTranscription()
      transcriberRef.current = null
    }
  }

  return (
    <div>
      <button onClick={startTranscription}>Start Transcription</button>
      <button onClick={stopTranscription}>Stop Transcription</button>
      <hr />
      <p>{transcription}</p>
    </div>
  )
}

export default App

`MediaElementAudioCapture`

constructor(): Creates a new instance of the media element audio capture.
captureAudio(mediaElement: HTMLMediaElement): MediaStream: Captures the audio stream from a given media element and returns a MediaStream.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme