react-native-japanese-tokenizer
v1.0.6
Published
Async Japanese Tokenizer Native Plugin for React Native for iOS and Android
Downloads
35
Readme
react-native-japanese-tokenizer
Async Japanese tokenizer native plugin for React Native that works on iOS and Android. It tokenizes Japanese text in background thread so it won't block UI. It comes in handy for such as indexing a lot of texts for full-text search.
How it works
iOS
It utilizes CFStringTokenizer
to tokenize strings into words.
Android
The implementation is based on TinySegmenter. It is lightweight word segmenter which works without dictionaries.
Getting started
$ npm install react-native-japanese-tokenizer --save
Mostly automatic installation
$ react-native link react-native-japanese-tokenizer
Manual installation
iOS
- In XCode, in the project navigator, right click
Libraries
➜Add Files to [your project's name]
- Go to
node_modules
➜react-native-japanese-tokenizer
and addRNJapaneseTokenizer.xcodeproj
- In Xcode, in the project navigator, select your project. Add
libRNJapaneseTokenizer.a
to your project'sBuild Phases
➜Link Binary With Libraries
- Run your project (
Cmd+R
)
Android
- Open up
android/app/src/main/java/[...]/MainActivity.java
- Add
import dog.craftz.japanese_tokenizer.RNJapaneseTokenizerPackage;
to the imports at the top of the file - Add
new RNJapaneseTokenizerPackage()
to the list returned by thegetPackages()
method
- Append the following lines to
android/settings.gradle
:include ':react-native-japanese-tokenizer' project(':react-native-japanese-tokenizer').projectDir = new File(rootProject.projectDir, '../node_modules/react-native-japanese-tokenizer/android')
- Insert the following lines inside the dependencies block in
android/app/build.gradle
:compile project(':react-native-japanese-tokenizer')
Usage
Using Promise
:
import Tokenizer from 'react-native-japanese-tokenizer'
var text = "週休七日で働きたい"
Tokenizer.tokenize(text).then(tokens => {
...
})
async/await
:
import Tokenizer from 'react-native-japanese-tokenizer'
var text = "週休七日で働きたい"
const tokens = await Tokenizer.tokenize(text)
Also check out the test to learn more.
License
MIT License. By Takuya Matsuyama (@craftzdog).
Note that this module is based on following libraries with respective license. Thank you for the fantastic works!
- TinySegmenter: Javascriptだけで実装されたコンパクトな分かち書きソフトウェア
- BSD-3-Clause
- TinySegmenter for Java by takscape
- Public Domain