flay-js
v0.1.1
Published
Extract structured content from any HTML website
Downloads
5
Maintainers
Readme
flay.js
A pure JS implementation of the Rapid Automated Keyword Extraction (RAKE) algorithm. Put in any text corpus, get back a bunch of keyphrases and keywords.
How to use
import { feed, website } from 'flay-js'
const websiteData = website(someHtmlString)
// or
const feedData = feed(someXmlString)
Implementation Details
This algorithm can parse both, websites and RSS/Atom feeds, and return structured data as good as possible.
ToDo
- more exhaustive documentation
- proper date handling
- browserify
LICENSE:
LGPL-3.0.
You can use this package in all your free or commercial products without any issues, but I want bugfixes and improvements to this algorithm to flow back into the public code repository.