HotSax
HotSAX is a fast, small footprint, non-validating SAX2 parser for HTML/XML/XHTML. It can be used in simple web agents, page scrapers, and spiders. It is similar to the Apache Xerces parser, except that it can generate SAX events for badly formatted HTML as well.
Metadata
Category: HTML Parsers
License: GNU Library or Lesser General Public License (LGPL)
Homepage: http://hotsax.sourceforge.net/
Sponsored Ad