sinew-node
v0.1.0
Published
Sinew-Node collects structured data from web sites (screen scraping).
Downloads
4
Readme
Welcome to Sinew-Node
Sinew-Node collects structured data from web sites (screen scraping).
Sinew is distributed as a ruby gem:
npm install sinew-node
Example
Here's an example for collecting Reddit's topic list:
sinew = require '../lib/sinew-node'
sinew.get 'http://www.reddit.com/r/javascript/', ->
(@$ '#siteTable div.thing a.title').each (index)->
console.log @innerHTML if index < 5
Features
- Sinew caches all HTTP requests on disk. That makes it possible to iterate quickly. Crawl once and then continue to work on your recipe. Run the recipe over and over while you tune your CSS selectors and regular expressions.