Arachnid
Arachnid is a Java-based web spider framework. It includes a simple HTML parser object that parses an input stream containing HTML content. Simple Web spiders can be created by sub-classing Arachnid and adding a few lines of code called after each page of a Web site is parsed.
Metadata
Category: Crawlers
License: GNU General Public License (GPL)
Homepage: http://arachnid.sourceforge.net/
Sponsored Ad