npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2024 – Pkg Stats / Ryan Hefner

pls-img-txt

v0.0.3

Published

An image to text package with tesseract.

Downloads

8

Readme

Image to Text

MPOP Reverse II

How to install

npm install pls-img-txt

How to use (scan) .scan(imagePath [, ocr_engine_mode] [, pageseg_mode])

The OCR (Optical Character Recognition) Engine Mode is one part of this project from its first release. According to IBM, Optical character recognition (OCR) is sometimes referred to as text recognition. An OCR program extracts and repurposes data from scanned documents, camera images and image-only pdfs. OCR software singles out letters on the image, puts them into words and then puts the words into sentences, thus enabling access to and editing of the original content. It also eliminates the need for manual data entry. The Page segmentation mode defines how your text should be treated by Tesseract. For example, if your image contains a single character or a block of text, you want to specify the corresponding psm so that you can improve accuracy. According to David Sixela. This two are now added as customized options in this project, it is still optional for developers to user with the default value of ocr_engine_mode = 2 and pageseg_mode = 3.

const { scan } = require("pls-img-txt")

let run = async () => {
	let output = await scan("./sampleimg.png")
	// await scan("./sampleimg.png", 2, 3)
	// This is just optional
	console.log(output)
}

run()

Result

{
	"result": "Sample text"
}

How to use (Add language) .addLanguage([language])

This feature is just optional, this package has already default installed languages which are english and the orientation and script detection (osd).

const pls_img_txt = require("pls-img-txt")

let run = async () => {
	pls_img_txt.addLanguage(pls_img_txt.CEBUANO)
	pls_img_txt.addLanguage(pls_img_txt.FILIPINO)
	pls_img_txt.addLanguage(pls_img_txt.TAGALOG)
	let output = await pls_img_txt.scan("./sampleimg.png")
	console.log(output)
}

run()

Add language is still in development, so that this feature might not be stable. Try to add some try catch to handle this kind of error and to avoid some crash on to your system.

Language Lists

  • ARABIC
  • CEBUANO
  • CHINESE_SIMPLIFIED
  • CHINESE_TRADITIONAL
  • GERMAN
  • GREEK
  • FILIPINO
  • HEBREW
  • JAPANESE
  • KOREAN
  • TAGALOG

For more language, kindly visit this link, and use the key language to add.


Credits

  1. Tesseract.js
  2. cli-progress
  3. ansi-colors