SYNOPSIS

 use HTML::TokeParser::Simple;
 my $p = HTML::TokeParser::Simple->new( $somefile );

 while ( my $token = $p->get_token ) {
     # This prints all text in an HTML doc (i.e., it strips the HTML)
     next unless $token->is_text;
     print $token->as_is;
 }

DESCRIPTION

Process Instructions are from \s-1XML\s0. This is very handy if you need to parse out \s-1PHP\s0 and similar things with a parser.

Currently, there appear to be some problems with process instructions. You can override this class if you need finer grained handling of process instructions.

\*(C`is_pi()\*(C' and \*(C`is_process_instruction()\*(C' both return true.

OVERRIDDEN METHODS

  • get_token0

  • is_pi

  • is_process_instruction

  • return_token0