cebuano-stemmer
v0.1.2
Published
A stemmer for the Cebuano language spoken in the Philippines
Downloads
5
Maintainers
Readme
Cebuano dictionary and stemmer
A dictionary and stemmer for the Cebuano language spoken in the Philippines.
Written in TypeScript, compiled to ES5 CommonJS module (for use in Node.js) and a single-file ES5 UMD module (for use in the browser).
Based on J. Hellingman's Cebuano stemmer for Java and dictionary for Android, which is in turn based on J. Wolff's seminal work A Dictionary of Cebuano Visayan.
Dictionary
The dictionary data is converted from a SQLite database to a publicly-readable CouchDB instance hosted at https://publicdomainreview.cloudant.com/cebuano_dictionary.
The primary index for an entry is the normalized root form of the Cebuano word plus an identification number.
You can use the regular CouchDB API to query. For instance, to get the first 20 entries starting with "k":
Search
The database provides full-text search for English and Cebuano. You can query using the Lucene Query Parser Syntax.
Examples:
|description|url| |---|---| |English full text search through entries for "fire"|https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromEnglish?q=fire| |English synonym search for "fire"|https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromEnglish?q=synonym:fire| |Cebuano full text search through entries for "abat"|https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromCebuano?q=abat| |Cebuano synonym search for "abat"|https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromCebuano?q=synonym:abat|
(for more information, add the query parameter include_docs=true
)
Stemmer
A JSON list of word roots used in the stemmer is available at https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/show/_list/keyset/normalized_heads_where_pos_is_not_empty_string?group_level=1
A JSON list of all heads is available at https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/show/_list/keyset/heads?group_level=1