@ai4bharat/indic-transliterate
v1.3.8
Published
Transliterate component for React
Downloads
754
Readme
Transliteration component for React with support for 21 Indic languages. Uses API from AI4Bharat IndicXlit.
Table of contents
- 1. About
- 2. Demo
- 3. Install
- 4. Usage
- 5. Get transliteration suggestions
- 6. Languages
- 7. New Feature: Caching for Frequently Used Words
- 8. License
1. About
This is a frontend library to enable your users to type in many different languages of South Asia, and can be integrated into any React-based application. This library is a fork of react-transliterate, which uses Google Transliterate API which supports around 40 languages across the globe. In this module, our focus is to provide high-quality transliteration-suggestions for Indic languages, especially for low-resource languages like Kashmiri, Manipuri, etc. (which are not supported by Google). For more details about the AI system behind this, please check AI4Bhārat Indic-Xlit.
2. Demo
Source of this demo is available in the example
folder of the repo.
3. Install
npm install --save @ai4bharat/indic-transliterate
OR
yarn add @ai4bharat/indic-transliterate
4. Usage
4.1. Basic example
import React, { useState } from "react";
import { IndicTransliterate } from "@ai4bharat/indic-transliterate";
import "@ai4bharat/indic-transliterate/dist/index.css";
const App = () => {
const [text, setText] = useState("");
return (
<IndicTransliterate
value={text}
onChangeText={(text) => {
setText(text);
}}
lang="hi"
/>
);
};
export default App;
4.2. With custom component
import React, { useState } from "react";
import { IndicTransliterate } from "@ai4bharat/indic-transliterate";
import "@ai4bharat/indic-transliterate/dist/index.css";
const App = () => {
const [text, setText] = useState("");
return (
<IndicTransliterate
renderComponent={(props) => <textarea {...props} />}
value={text}
onChangeText={(text) => {
setText(text);
}}
lang="hi"
/>
);
};
export default App;
4.3. With TypeScript
import React, { useState } from "react";
import { IndicTransliterate, Language } from "@ai4bharat/indic-transliterate";
import "@ai4bharat/indic-transliterate/dist/index.css";
const App = () => {
const [text, setText] = useState("");
const [lang, setLang] = useState<Language>("hi");
return (
<IndicTransliterate
renderComponent={(props) => <textarea {...props} />}
value={text}
onChangeText={(text) => {
setText(text);
}}
lang={lang}
/>
);
};
export default App;
4.4. With Material-UI
import React, { useState } from "react";
import { IndicTransliterate, Language } from "@ai4bharat/indic-transliterate";
import "@ai4bharat/indic-transliterate/dist/index.css";
import Input from "@material-ui/core/Input";
const App = () => {
const [text, setText] = useState("");
const [lang, setLang] = useState<Language>("hi");
return (
<IndicTransliterate
renderComponent={(props) => {
const inputRef = props.ref;
delete props["ref"];
return <Input {...props} inputRef={inputRef} />;
}}
value={text}
onChangeText={(text) => {
setText(text);
}}
lang={lang}
/>
);
};
export default App;
4.5. With External API for transliteration
import { IndicTransliterate } from "@ai4bharat/indic-transliterate";
import "@ai4bharat/indic-transliterate/dist/index.css";
const App = () => {
const [text, setText] = useState("");
return (
<IndicTransliterate
value={text}
onChangeText={(text) => {
setText(text);
}}
lang="hi"
customApiURL="https://your-custom-transliteration-api/{language}/{text}"
/>
);
};
export default App;
This allows users to integrate their own APIs instead of relying on the default transliteration API, as long as the external API is a GET
API with the following format: https://your-custom-transliteration-api/{language}/{text}
and returns a response in this format:
{
"result": [
"suggestion-1",
"suggestion-2",
"suggestion-3",
"suggestion-4",
"suggestion-5",
...
],
...
}
5. Get transliteration suggestions
5.1. Standalone example
import { getTransliterateSuggestions } from "@ai4bharat/indic-transliterate";
const data = await getTransliterateSuggestions(
word, // word to fetch suggestions for
{
numOptions: 5, // number of suggestions to fetch
showCurrentWordAsLastSuggestion: true, // add the word as the last suggestion
lang: "hi", // target language
},
);
5.2. Custom trigger keys
Keys which when pressed, input the current selection to the textbox
Indic Transliterate uses the event.keycode
property to detect keys. Here are some predefined keys you can use. Or, you can enter the integer codes for any other key you'd like to use as the trigger
import React, { useState } from "react";
import { IndicTransliterate, TriggerKeys } from "@ai4bharat/indic-transliterate";
import "@ai4bharat/indic-transliterate/dist/index.css";
import Input from "@material-ui/core/Input";
const App = () => {
const [text, setText] = useState("");
return (
<IndicTransliterate
value={text}
onChangeText={(text) => {
setText(text);
}}
lang="hi"
triggerKeys={[
TriggerKeys.KEY_RETURN,
TriggerKeys.KEY_ENTER,
TriggerKeys.KEY_SPACE,
TriggerKeys.KEY_TAB,
]}
/>
);
};
export default App;
5.3. Props
| Prop | Required? | Default | Description |
| -------------------------------- | --------- | ------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------ |
| onChangeText | Yes | | Listener for the current value from the component. (text: string) => void
|
| value | Yes | | value
prop to pass to the component |
| enabled | | true | Control whether suggestions should be shown |
| renderComponent | | (props) => <input {...props} />
| Component to render. You can pass components from your component library as this prop |
| lang | | hi | Language you want to transliterate. See the following section for language codes |
| maxOptions | | 5 | Maximum number of suggestions to show in helper |
| offsetY | | 0 | Extra space between the top of the helper and bottom of the caret |
| offsetX | | 0 | Extra space between the caret and left of the helper |
| containerClassName | | empty string | Classname passed to the container of the component |
| containerStyles | | {} | CSS styles object passed to the container |
| activeItemStyles | | {} | CSS styles object passed to the active item <li>
tag |
| hideSuggestionBoxOnMobileDevices | | false
| Should the suggestions be visible on mobile devices since keyboards like Gboard and Swiftkey support typing in multiple languages |
| hideSuggestionBoxBreakpoint | | 450 | type: number
. To be used when hideSuggestionBoxOnMobileDevices
is true. Suggestion box will not be shown below this device width |
| triggerKeys | | KEY_SPACE, KEY_ENTER, KEY_TAB, KEY_RETURN
| Keys which when pressed, input the current selection to the textbox |
| insertCurrentSelectionOnBlur | | true
| Should the current selection be inserted when blur
event occurs |
| showCurrentWordAsLastSuggestion | | true
| Show current input as the last option in the suggestion box |
| horizontalView | | false
| When true
suggestions appear side by side, improving readability and accessibility for users who prefer a compact layout |
| customApiURL | | indic transliterate api | Flexibility to use external API for transliteration, seamless integration as long as the response structure follows the required format 4.5 |
6. Languages
6.1. Get supported languages
import { getTransliterationLanguages } from "@ai4bharat/indic-transliterate";
const data = await getTransliterationLanguages();
6.2. List of language codes
Currently supports the following 21 languages from the Indian subcontinent:
|ISO 639 code | Language | |---|--------------------| |as |Assamese - অসমীয়া | |bn |Bangla - বাংলা | |brx|Boro - बड़ो | |gu |Gujarati - ગુજરાતી | |hi |Hindi - हिंदी | |kn |Kannada - ಕನ್ನಡ | |ks |Kashmiri - كٲشُر | |gom|Konkani Goan - कोंकणी| |mai|Maithili - मैथिली | |ml |Malayalam - മലയാളം| |mni|Manipuri - ꯃꯤꯇꯩꯂꯣꯟ | |mr |Marathi - मराठी | |ne |Nepali - नेपाली | |or |Oriya - ଓଡ଼ିଆ | |pa |Panjabi - ਪੰਜਾਬੀ | |sa |Sanskrit - संस्कृतम् | |sd |Sindhi - سنڌي | |si |Sinhala - සිංහල | |ta |Tamil - தமிழ் | |te |Telugu - తెలుగు | |ur |Urdu - اُردُو |
7. New Feature: Caching for Frequently Used Words
Overview:
The module now supports caching of up to 10,000 words per language to enhance performance by reducing API calls. The caching mechanism stores words and their corresponding suggestions, improving the speed and efficiency of generating suggestions, especially for frequently used words.
How It Works:
Cache Initialization:
When a user begins typing, the tool fetches suggestions for each word. If a word is used for the first time, it is added to the cache with its suggestions and an initial frequency count of1
.Cache Structure:
The cache is structured by language, where each language has its own dictionary of words and their suggestions:{ language-1: { word-1: { suggestions: string[], frequency: number }, word-2: { suggestions: string[], frequency: number }, word-3: { suggestions: string[], frequency: number }, ... }, language-2:{ ... }, ... }
Cache Lookup:
- When a word is typed, the tool first searches for it in the cache.
- If found, the suggestions are retrieved, and the frequency count is incremented.
- If not found, the tool fetches suggestions from the API, adds the word to the cache, and initializes the frequency count to 1.
Cache Management:
- The cache is limited to 10,000 words per language.
- When the cache for a specific language exceeds 10,000 words, the least frequently used word is removed to make room for new entries.
Example Workflow:
- User starts typing a word (e.g., "hello"):
- The tool checks the cache for the word.
- If "hello" is found, the suggestions are retrieved, and the frequency is incremented.
- If "hello" is not found, suggestions are fetched from the API, and "hello" is added to the cache with a frequency of 1.
- Cache size exceeds 10,000 words:
- The least frequently used word is replaced with the new word and its suggestions.
- User starts typing a word (e.g., "hello"):
Key Benefits:
- Improved Performance: Reduces the number of API calls by utilizing cached suggestions for frequently used words.
- Adaptive Caching: Automatically adjusts to user behavior, increasing the frequency count for commonly used words.
- Efficient Storage: Least frequently used words are replaced to maintain an optimal cache size.
8. License
MIT © ai4bharat
Sincere thanks to burhanuday for making his work open-source!