@pexxi/fold-to-ascii-ts
v5.1.1
Published
A JavaScript port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into a ASCII equivalents, if they exist.
Downloads
17,465
Maintainers
Readme
fold-to-ascii-ts
A JavaScript port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents.
Documentation
Installation
Using npm:
npm install --save @pexxi/fold-to-ascii-ts
Using yarn:
yarn add @pexxi/fold-to-ascii-ts
Usage
There are two different modes of operation:
- Replace all known non-ASCII characters with appropriate replacements, replace the unknown ones with a fallback (
foldReplacing
). - Replace all known non-ASCII characters with appropriate replacements, maintain the unknown ones (
foldMaintaining
).
The difference in output only manifests if the inputs contain characters without known replacements:
import ASCIIFolder from "fold-to-ascii-ts");
// Some Characters have no defined replacement.
// Specify a fixed replacement character (defaults to the empty string).
ASCIIFolder.foldReplacing("Lörem 🤧 ëripuît") === "Lorem eripuit";
ASCIIFolder.foldReplacing("Lörem 🤧 ëripuît", "X") === "Lorem XX eripuit";
ASCIIFolder.foldMaintaining("Lörem 🤧 ëripuît") === "Lorem 🤧 eripuit";
Tests
npm test
Sources
This is a fork for adding TypeScript types, originally from https://github.com/mplatt/fold-to-ascii .
This is a straightforward port of the extensive switch/case statement found in http://svn.apache.org/repos/asf/lucene/java/tags/lucene_solr_4_5_1/lucene/analysis/common/src/java/org/apache/lucene/analysis/miscellaneous/ASCIIFoldingFilter.java