eaze-sitemap-generator
v2.0.1
Published
Combines blog slugs with a web crawler to build an xml sitemap
Downloads
1
Readme
Site Map
This script will use sitemap-generator-cli to scrape the website for urls and generates an xml sitemap from those urls. It will then combine the blog slugs found here and a base-url to generate a site map with all (non-product) links. There are two boolean flags to control whether to test the url for a 200 response (one for existing sitemap urls, and one for the blog slugs).
Variables
- BLOGS_BASE_PATH (string) - Sets base url for blog (default: https://cms.ore0.eaze.tech)
- SCRAPE_WEBSITE (bool) - Sets whether to scrape website (default: true)
- CHECK_BLOG_URLS (bool) - Sets whether to test that the blog slugs + base url resolve (default: false)
- CHECK_SCRAPED_URLS (bool) - Sets whether to check the links within the existing sitemap.xml (default: false)
- DEBUG (bool) - Sets whether to display debug information (default: false)
Installation
git clone [email protected]:eaze/site-map.git
cd site-map
npm install
Usage
npm run web-start
CI/CD
A CircleCI script runs on any push to master and at 9am UTC (1am PST) to generate a new sitemap.xml
file and push it to an s3 bucket.