create-spidergram
v0.9.3
Published
Initialize a new Spidergram based project
Downloads
8
Readme
Build a custom web analysis tool with Spidergram
Spidergram is a toolkit for crawling and analyzing complex web properties. create-spidergram
is a quick and easy way to set up a new Spidergram project of your own.
Usage
- Ensure you're running NodeJS 18 (
node -v
) - Install ArangoDB via direct download or homebrew. Alternately, if you've got Docker installed, you can use Spidergram's included docker-compose.yml file to spin up an Arango container for testing and development.
- Create a new project directory,
cd
into it, and runnpx create-spidergram
. You'll be prompted for the project's name and your choice of project template. - Run
npm install
- Kick the tires with
npm run crawl <url>
, or dive right in to customizing the project.
The Templates
- Boilerplate is a simple script that fires up a Spidergram crawler, grabs the contents of one or more sites, and prints out a summary report of their URL structures.
- Boilerplate (Typescript) is a Typescript version of Boilerplate, with no other functional differences.
- Crawl with Report (Typescript) demonstrates basic data extraction and report generation in plaintext and Excel formats.