n8n-nodes-webpage-content-extractor
v0.1.2
Published
Extracts the content from a given URL. Similar to the 'Reader' mode in your browser, it ignores headers, footers, banners, etc.
Downloads
360
Readme
n8n-nodes-webpage-content-extractor
This is an n8n community node. It extracts the contents from a given URL. Similar to the 'Reader' mode in your browser, it ignores headers, footers, banners, etc.
It's based on the Readability library that is used by Firefox's Reader View.
Installation
Compatibility
Usage
Resources
Version history
Installation
Follow the installation guide in the n8n community nodes documentation.
Compatibility
Tested on n8n v1.20.0 and above.
Usage
To use this node, fetch a website's HTML code with a "HTTP Request" node and then pass it on to WebpageContentExtractor.
This allows you to configure all aspects of the web request (method, authentication, headers, etc).
Resources
Version history
0.1.2 (2024-11-20)
- Add "content" attribute to output
0.1.1
- Fixing the README
0.1.0
Initial release