webl-scraper

v1.0.1

Published

2 years ago

Simple and fast web scraper.

Downloads

0High
0Medium
0Low

andreagennaioli

http web scraper promise

webl-scraper

A fast and easy-to-use package for manipulating online page content.

Install

Install our package using npm:

npm install webl-scraper

How To Use

Code example:

/* Import */
const { Selector, Scraper } = require("webl-scraper");

/* Create the scraper */
const scraper = new Scraper("https://en.wikipedia.org/wiki/Rome",
  /* List of selectors */
  [
    /* This selector get all the 'a' tags and reads the href attribute value */
    new Selector(
      "a",
      "href"
    ),
    /* This selector gets the 'innerHTML' propriety of the selected elements */
    new Selector(
      "h2 > span.mw-headline",
      "__innerHTML" // Using HTML element properties
    ),
    new Selector(
      "h3 > span.mw-headline",
      "__innerHTML"
    ),
  ]
);

/* Start the scraper. Returns a Promise */
scraper.scrape().then(r => {
	console.log(r)
})

Output:

[
  {
    selector: 'a',
    attr: 'href',
    values: [
      '/wiki/Latium',
      '/wiki/Tiber',
      '/wiki/Vatican_City',
      ...
    ]
  },
  {
    selector: 'h2 > span.mw-headline',
    attr: '__innerHTML',
    values: [
      'Etymology',
      'History',
      'Government',
      ...
    ]
  },
  {
    selector: 'h3 > span.mw-headline',
    attr: '__innerHTML',
    values: [
      'Earliest history',
      'Monarchy and republic',
      'Empire',
      ...
    ]
  }
]

Pkg
Stats

Discover Tips

General search

Package details

User packages

Sponsor

About

Twitter

GitHub

Twitter

GitHub

Site

Open Software & Tools

Framework

Server

Data Store

Caching

CSS / Styling

Typeface

Avatars

Data Viz

Date formatting

Infinite scrolling

Markdown rendering

Repository url parsing

User data

Compiling

Types

Odds & Ends

webl-scraper

v1.0.1

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

webl-scraper

Install

How To Use