Package org.apache.lenya.lucene.html

Interface Summary
HTMLParserConstants  
 

Class Summary
Entities  
HtmlContentHandler  
HtmlDocument The HtmlDocument class creates a Lucene Document from an HTML document.
HTMLParser HTML Parser
HTMLParserTokenManager  
SimpleCharStream An implementation of interface CharStream, where the stream is assumed to contain only ASCII characters (without unicode processing).
Token Describes the input token stream.
 

Exception Summary
ParseException This exception is thrown when parse errors are encountered.
 

Error Summary
TokenMgrError  
 



Copyright © 1999-2005 Apache Software Foundation. All Rights Reserved.