Package org.htmlparser.visitors
Class TextExtractingVisitor
java.lang.Object
org.htmlparser.visitors.NodeVisitor
org.htmlparser.visitors.TextExtractingVisitor
Extracts text from a web page.
Usage:
Parser parser = new Parser(...);
TextExtractingVisitor visitor = new TextExtractingVisitor();
parser.visitAllNodesWith(visitor);
String textInPage = visitor.getExtractedText();
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoidvisitEndTag(Tag tag) Called for eachTagvisited that is an end tag.voidvisitStringNode(Text stringNode) Called for eachStringNodevisited.voidCalled for eachTagvisited.Methods inherited from class org.htmlparser.visitors.NodeVisitor
beginParsing, finishedParsing, shouldRecurseChildren, shouldRecurseSelf, visitRemarkNode
-
Constructor Details
-
TextExtractingVisitor
public TextExtractingVisitor()
-
-
Method Details
-
getExtractedText
-
visitStringNode
Description copied from class:NodeVisitorCalled for eachStringNodevisited.- Overrides:
visitStringNodein classNodeVisitor- Parameters:
stringNode- The string node being visited.
-
visitTag
Description copied from class:NodeVisitorCalled for eachTagvisited.- Overrides:
visitTagin classNodeVisitor- Parameters:
tag- The tag being visited.
-
visitEndTag
Description copied from class:NodeVisitorCalled for eachTagvisited that is an end tag.- Overrides:
visitEndTagin classNodeVisitor- Parameters:
tag- The end tag being visited.
-