SYNOPSIS

 use HTML::TokeParser::Simple;
 my $p = HTML::TokeParser::Simple->new( $somefile );

 while ( my $token = $p->get_token ) {
     # This prints all text in an HTML doc (i.e., it strips the HTML)
     next unless $token->is_text;
     print $token->as_is;
 }

DESCRIPTION

This class does most of the heavy lifting for \*(C`HTML::TokeParser::Simple\*(C'. See the \*(C`HTML::TokeParser::Simple\*(C' docs for details.

OVERRIDDEN METHODS

  • as_is

  • get_tag

  • is_end_tag

  • is_tag

  • return_text

  • rewrite_tag