WebSPHINX
WebSPHINX ( Website-Specific Processors for HTML INformation eXtraction) is a Java class library and interactive development environment for Web crawlers that browse and process Web pages automatically.
Metadata
Category: Crawlers
License: Apache Software License
Homepage: http://www-2.cs.cmu.edu/~rcm/websphinx/
Sponsored Ad