Package org.htmlparser.nodes


package org.htmlparser.nodes
The nodes package has the concrete node implementations.

The classes in this package all implement the Node interface. They individually implement the more specific interfaces for text, remarks and tags. The three node types are mapped as follows:

InterfaceConcrete Implementation
TextTextNode
RemarkRemarkNode
TagTagNode
These classes are derived from the AbstractNode class, which provides basic functionality common to all nodes, such as maintaining start and end positions and child and parent references. Only the TagNode objects contain a list of Attribute objects.

The Lexer parses an HTML stream into a contiguous stream of these nodes.

The Parser returns either these nodes or specific Tag objects (which are subclasses of TagNode) for tags with names that have been registered via registerTag().

  • Classes
    Class
    Description
    The concrete base class for all types of nodes (tags, text remarks).
    The remark tag is identified and represented by this class.
    TagNode represents a generic tag.
    Normal text in the HTML document is represented by this class.