wikiminer

v1.0.1

Published

2 years ago

a wrapper for the Wikipedia API that provides data mining functionality.

Downloads

0High
0Medium
0Low

galencorey

lisaveras

Wikipedia data mining stop words machine learning word frequency

logo ##Wikiminer

###About Wikiminer is a wrapper for the Wikipedia API that provides data mining functionality. By simply providing the title of a Wikipedia article, Wikiminer returns a promise, passing through an object that contains:

the text of that article
the text of that article stripped of stopwords
an array containing the most common words in the article and their occurences
an array containing the least common words in the article and their occurences

When finding most and least common words, case is ignored and singular and plural versions of words are counted together.

###Usage

npm install wikiminer

const wikiminer = require('wikiminer');
wikiminer(*title*, *topCount*, *leastCount*);

title: String, must be a valid title of a Wikipedia page.
topCount: Number, number of words to include in most common words list. Default: 20
leastCount: Number, number of words to include in least common words list. Default: 20

result structure { text: '', textMinusStop: '', topWords: [], leastWords: [] }

###Example

The following example

const wikiminer = require('wikiminer');

wikiminer('dog', 5, 5)
.then(function(result){
  console.log(result)
})

Will print to the console:

{
  text: /*the text of the dog article*/,
  textMinusStop: /*the text of the dog article without stop words */ ,
  topWords:[ { word: 'dog', count: 345 },
     { word: 'human', count: 60 },
     { word: 'wolf', count: 48 },
     { word: 'breed', count: 47 },
     { word: 'pet', count: 40 } ],
  leastWords: [ { word: 'inadequate', count: 1 },
     { word: 'taxa', count: 1 },
     { word: 'colour', count: 1 },
     { word: 'sobriquet', count: 1 },
     { word: 'etymology', count: 1 } ]
}

Pkg
Stats

Discover Tips

General search

Package details

User packages

Sponsor

About

Twitter

GitHub

Twitter

GitHub

Site

Open Software & Tools

Framework

Server

Data Store

Caching

CSS / Styling

Typeface

Avatars

Data Viz

Date formatting

Infinite scrolling

Markdown rendering

Repository url parsing

User data

Compiling

Types

Odds & Ends

wikiminer

v1.0.1

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme