WebSPHINX

Inactive

Java class library and Crawler Workbench for building web crawlers. Multithreaded retrieval, page/link model, robot exclusion, pattern matching; CMU research project (Apache-style license).

Metadata
Category: Crawlers
License:Apache-2.0
Sponsored Ad