@capacitor-community/image-to-text

v7.0.0

Published

24 days ago

Image to Text (OCR) Plugin for Capacitor

Downloads

2,560

Credits

This project was forked from the Cap ML plugin written by Vennela Kodali. It was refactored and converted to Capacitor 4.

For Capacitor 4 projects use v4.x
For Capacitor 5 projects use v5.x
For Capacitor 6 projects use v6.x

Installation

npm install @capacitor-community/image-to-text

Usage

There is one method detectText that takes a filename of an image and will return the text associated with it.

Add the following to your application:

import { Ocr, TextDetections } from '@capacitor-community/image-to-text';

...

const data: TextDetections = await Ocr.detectText({ filename: '[get-filename-of-image-jpg]' });
for (let detection of data.textDetections) {
    console.log(detection.text);
}

The above code will convert the image file and console.log the text found in it.

Example with Camera

You can use the @capacitor/camera plugin to take a photo and convert it to text:

import { Camera, CameraResultType, CameraSource } from '@capacitor/camera';
import { Ocr, TextDetections } from '@capacitor-community/image-to-text';

...

const photo = await Camera.getPhoto({
  quality: 90,
  allowEditing: true,
  resultType: CameraResultType.Uri,
  source: CameraSource.Camera,
});

const data: TextDetections = await Ocr.detectText({ filename: photo.path });

for (let detection of data.textDetections) {
  console.log(detection.text);
}

A full sample application can be found here.

video of scanning a card and it being converted to text

iOS Usage

No additional setup is required to use this plugin in a iOS Capacitor project.

Android Usage

Your project must include a google-services.json file stored in the Android project folder (usually android/app).

Create Firebase Project and App

Sign in to console.firebase.google.com
Click on Add Project and follow through the steps.
Click the Android icon to create an android app.
Enter the Package Name which must match your apps package name (You can find it in android/app/AndroidManifest.xml).
Click Register App
Download google-services.json and save into your project's android/app directory.

Add Firebase SDK

The sample project has this in place in its build.gradle (see here as a reference).

Note: Most starter Capacitor projects are preconfigured to load google-services.json.

API Reference

detectText(...)

detectText(options: DetectTextFileOptions | DetectTextBase64Options) => Promise<TextDetections>

Detect text in an image

| Param | Type | Description | | ------------- | ----------------------------------------------------------------------------------------------------------------------------------------- | -------------------------- | | options | DetectTextFileOptions | DetectTextBase64Options | Options for text detection |

Returns: Promise<TextDetections>

Interfaces

TextDetections

| Prop | Type | | -------------------- | ---------------------------- | | textDetections | TextDetection[] |

TextDetection

| Prop | Type | | ----------------- | ----------------------------- | | bottomLeft | [number, number] | | bottomRight | [number, number] | | topLeft | [number, number] | | topRight | [number, number] | | text | string |

DetectTextFileOptions

| Prop | Type | | ----------------- | ------------------------------------------------------------- | | filename | string | | orientation | ImageOrientation |

DetectTextBase64Options

| Prop | Type | | ----------------- | ------------------------------------------------------------- | | base64 | string | | orientation | ImageOrientation |

Enums

ImageOrientation

| Members | Value | | ----------- | -------------------- | | Up | 'UP' | | Down | 'DOWN' | | Left | 'LEFT' | | Right | 'RIGHT' |

Compatibility

Images are expected to be in portrait mode only, i.e. with text facing up. It will try to process even otherwise, but note that it might result in gibberish.

iOS and Android are supported. Web is not.

| Feature | ios | android | | -------------------------------- | --------------------------- | -------------------------------------------------------------------------------------------------------------------- | | ML Framework | CoreML Vision | Firebase MLKit | | Text Detection with Still Images | Yes | Yes | | Detects lines of text | Yes | Yes | | Bounding Coordinates for Text | Yes | Yes | | Image Orientation | Yes (Up, Left, Right, Down) | Yes (Up, Left, Right, Down) | | Skewed Text | Yes | Unreliable | | Rotated Text (<~ 45deg) | Yes | Yes (but with noise) | | On-Device | Yes | Yes | | SDK/ios Version | ios 13.0 or newer | Targets API level >= 16Uses Gradle >= 4.1com.android.tools.build:gradle >= v3.2.1compileSdkVersion >= 28 | | | | |

License

Hippocratic License Version 2.0.

For more information, refer to LICENSE file