page-scrapper

v1.0.1

Published

3 years ago

A simple node.js scrapper that pulls out all links and images of a given site.

Downloads

0High
0Medium
0Low

rocktimsaikia

page-scrapper node-scrapper site-scrapper link-scrapper website-scrapper web-scapper scrapper

page-scrapper

A simple node.js scrapper that pulls out all links and images of a given site. :package:

Installation

npm install page-scrapper

Highlights

Super easy to use
Removes duplicate links/images by default
Filters out the relative paths (configurable)
Tests cases added

Basic Usage

const pageScrapper = require('page-scrapper');

(async() => {
    const data = await pageScrapper('https://jsonplaceholder.typicode.com/');

    console.log(data);
    /* =>
    {
        links: [
            'https://dev.to/typicode/what-s-new-in-husky-5-32g5',
            'https://github.com/sponsors/typicode',
            'https://blog.typicode.com',
            'https://my-json-server.typicode.com',
            'https://github.com/typicode/json-server',
            'https://github.com/typicode/lowdb',
            'https://tryretool.com/?utm_source=sponsor&utm_campaign=typicode',
            'https://mockend.com',
            'https://github.com/users/typicode/sponsorship',
            'https://github.com/typicode'
        ],
        images: [
            'https://i.imgur.com/IBItATn.png',
            'https://mockend.com/banner.svg'
        ]
    }
    */
})();

Options

There are the currently available options

| Option | Required | Default | Description | | :------------- | :----------: | :-----------: | -----------| | absoluteOnly | No | true | Only scraps the absolute links. When set it to false it will fetch the relative paths too.|

Contribute

For any new feature request or bug report, please open an issue or pull request in GitHub.

meta-fecther - Tiny URL meta-data fetcher(scrapper) for Node.js

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

page-scrapper

Installation

Highlights

Basic Usage

Options

Contribute

Related

License