reduce-umls-terms
v0.0.5
Published
Remove duplicate terms from a list
Downloads
8
Maintainers
Readme
Reduce UMLS Terms
In UMLS the cui codes access many translations and spelling variations. However, there are a lot of variations available, including terms with only a difference in capitalization or symbols (such as a semicolon) added.
This module removes duplicate entries from a list UMLS terms
Installation
Use npm:
$ npm install reduce-umls-terms
How to
var reducer = require('reduce-umls-terms');
var entry = [
"Reumatoïde artritis",
"Artritis, reumatoïde",
"artritis; atrofisch",
"artritis; reumatoïd",
"atrofisch; artritis",
"reumatoïd; artritis",
"Reumatoïde artritis, niet gespecificeerd",
"RA",
"atrofische artritis",
"reumatische artritis",
"reumatoïde artritis",
"reumatoïde artritis",
"reumatoïde artritis",
"reumatoïde artritis",
"systemische reumatoïde artritis"
];
var terms = reducer.reduce(entry);
terms = reducer.remove_container_duplicates(terms);
/* Gives
[
'reumatoide artritis',
'ra',
'atrofische artritis',
'reumatische artritis',
'artritis reumatoide',
'artritis atrofisch',
'artritis reumatoid',
'atrofisch artritis',
'reumatoid artritis'
]
*/
// It is possible to further reduce the list of terms
// if the order does not matter (say "reumatoid artritis" == "artritis rematoid")
terms = reducer.remove_order_duplicates(terms);
/* Now gives
[
'ra',
'reumatoide artritis',
'atrofische artritis',
'reumatische artritis',
'artritis atrofisch',
'artritis reumatoid'
]
*/