HN,
So who do you guys use more? Import.io or Kimono? I have heard good things about both.
I prefer to rely on code that doesn't rely on an API that could just vanish the next day or cost a bucket to run.
What do you use for scraping? I may have a scraping project later this year and would love recommendations.
- Fetchbot: https://github.com/PuerkitoBio/fetchbot
Flexible, similar API to net/http (uses a Handler interface with a simple mux provided, supports middleware, etc.)
- gocrawl: https://github.com/PuerkitoBio/gocrawl
Higher-level, more framework than library.
Coupled with goquery (https://github.com/PuerkitoBio/goquery ) to scrape the dom (well, the net/html nodes), this makes custom scrapers trivial to write.
(sorry for the self-promoting comment, but this is quite on topic)
edit: polite crawlers, not scrapers.