Tags: StormCrawler


In this preview of his upcoming talk at ApacheCon, Julien Nioche explains how StormCrawler can be used to build a distributed web crawler.

StormCrawler: An Open Source SDK for Building Web Crawlers with ApacheStorm

StormCrawler is an open source collection of reusable resources, mostly implemented in Java, for building low-latency, scalable web crawlers on Apache Storm. In his upcoming talk at ApacheCon, Julien Nioche, Director of DigitalPebble Ltd, will compare StormCrawler with similar projects, such as...
Read 0 Comments
Click Here!