Package org.jsoup.parser
Class HtmlTreeBuilder
- java.lang.Object
-
- org.jsoup.parser.HtmlTreeBuilder
-
public class HtmlTreeBuilder extends java.lang.ObjectHTML Tree Builder; creates a DOM from Tokens.
-
-
Field Summary
Fields Modifier and Type Field Description protected java.lang.StringbaseUriprotected org.jsoup.parser.TokencurrentTokenprotected Documentdocstatic intMaxScopeSearchDepthprotected Parserparserprotected java.util.Map<java.lang.String,Tag>seenTagsprotected ParseSettingssettingsprotected java.util.ArrayList<Element>stack
-
Constructor Summary
Constructors Constructor Description HtmlTreeBuilder()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected ElementcurrentElement()Get the current element (last on the stack).protected booleancurrentElementIs(java.lang.String normalName)Checks if the Current Element's normal name equals the supplied name.protected voiderror(java.lang.String msg)If the parser is tracking errors, add an error at the current position.protected voiderror(java.lang.String msg, java.lang.Object... args)If the parser is tracking errors, add an error at the current position.protected voidinitialiseParse(java.io.Reader input, java.lang.String baseUri, Parser parser)protected booleanisContentForTagData(java.lang.String normalName)(An internal method, visible for Element.protected booleanprocess(org.jsoup.parser.Token token)protected booleanprocessEndTag(java.lang.String name)protected booleanprocessStartTag(java.lang.String name)booleanprocessStartTag(java.lang.String name, Attributes attrs)protected voidrunParser()protected TagtagFor(java.lang.String tagName, ParseSettings settings)java.lang.StringtoString()
-
-
-
Field Detail
-
MaxScopeSearchDepth
public static final int MaxScopeSearchDepth
- See Also:
- Constant Field Values
-
parser
protected Parser parser
-
doc
protected Document doc
-
stack
protected java.util.ArrayList<Element> stack
-
baseUri
protected java.lang.String baseUri
-
currentToken
protected org.jsoup.parser.Token currentToken
-
settings
protected ParseSettings settings
-
seenTags
protected java.util.Map<java.lang.String,Tag> seenTags
-
-
Method Detail
-
initialiseParse
@ParametersAreNonnullByDefault protected void initialiseParse(java.io.Reader input, java.lang.String baseUri, Parser parser)
-
process
protected boolean process(org.jsoup.parser.Token token)
-
toString
public java.lang.String toString()
- Overrides:
toStringin classjava.lang.Object
-
isContentForTagData
protected boolean isContentForTagData(java.lang.String normalName)
(An internal method, visible for Element. For HTML parse, signals that script and style text should be treated as Data Nodes).
-
runParser
protected void runParser()
-
processStartTag
protected boolean processStartTag(java.lang.String name)
-
processStartTag
public boolean processStartTag(java.lang.String name, Attributes attrs)
-
processEndTag
protected boolean processEndTag(java.lang.String name)
-
currentElement
protected Element currentElement()
Get the current element (last on the stack). If all items have been removed, returns the document instead (which might not actually be on the stack; use stack.size() == 0 to test if required.- Returns:
- the last element on the stack, if any; or the root document
-
currentElementIs
protected boolean currentElementIs(java.lang.String normalName)
Checks if the Current Element's normal name equals the supplied name.- Parameters:
normalName- name to check- Returns:
- true if there is a current element on the stack, and its name equals the supplied
-
error
protected void error(java.lang.String msg)
If the parser is tracking errors, add an error at the current position.- Parameters:
msg- error message
-
error
protected void error(java.lang.String msg, java.lang.Object... args)If the parser is tracking errors, add an error at the current position.- Parameters:
msg- error message templateargs- template arguments
-
tagFor
protected Tag tagFor(java.lang.String tagName, ParseSettings settings)
-
-