textractor-js
v0.1.6
Published
<h1 align="center">textractor-js ๐๐ธ๏ธ</h1>
Downloads
9
Readme
A simple text extractor for various files compiled to WebAssembly. This is intended to be used directly in the browser to quickly and easily extract text from various file formats.
If you are looking for the main textractor
project, you can find it here.
Installation
You can use the textractor
package in your own project by installing from npm:
npm install textractor-js
This is very much a work in progress. I am working to prioritize adding PPTX and XLSX support, as well as improving the text extraction for PDFs.
Supported formats
- [X] Text (txt)
- [X] PDF
- [X] Word (docx)
- [ ] Excel (xlsx)
- [ ] PowerPoint (pptx)
- [ ] Images (png, jpg, etc)