bbc-reader
v1.0.9
Published
Reads a BBC article from BBC.com
Downloads
3
Readme
bbc-reader
Scrape a BBC article from BBC.com
Install
npm install bbc-reader --save
Use
var BBCReader = require('bbc-reader');
var bbcreader = new BBCReader();
// Promise
bbcreader.read('http://www.bbc.com/news/world-europe-34602621').then(function(article) {
// Do Something with Article
});
// Callback
bbcreader.read('http://www.bbc.com/news/world-europe-34602621', function(article) {
// Do Something with Article
});
Article
var Article = {
title: '',
datetime: '',
body: {
clean: '',
markdown: ''
},
images: [
{
full: ''
}
],
source: ''
};
title The title of the Article. What appears in the h1 on the page.
datetime
The datetime with timezone of the last update of the article. Format: YY-mm-dd H:i:s GMT
. The datetime will always be GMT+0000
.
body The body of the article. Comes in two formats. clean and minimal. The clean format removes all html elements and separates paragraphs by two newlines. Markdown attempts to provide a markdown version of the article.
images
An array of image urls found in the body. Comes in sizes full
for each image.
source The url of the bbc article.