Class TagTokenizer


  • public class TagTokenizer
    extends Object
    Splits a chunk of HTML into 'text' and 'tag' tokens, for easy processing. Is VERY tolerant to badly formed HTML.

    Usage

    You need to supply a custom TokenHandler that will receive callbacks as text and tags are processed.

    char[] input = ...;
     HTMLTagTokenizer tokenizer = new HTMLTagTokenizer(input);
     TokenHandler handler = new MyTokenHandler();
     tokenizer.start(handler);
    Author:
    Joe Walnes
    See Also:
    TokenHandler, HTMLPageParser
    • Field Detail

      • input

        private final char[] input
    • Constructor Detail

      • TagTokenizer

        public TagTokenizer​(char[] input)
      • TagTokenizer

        public TagTokenizer​(String input)