SYNOPSIS

 use XML::PatAct::ToObjects;

 my $patterns = [ PATTERN => [ OPTIONS ],
                  PATTERN => "PERL-CODE",
                  ... ];

 my $matcher = XML::PatAct::ToObjects->new( Patterns => $patterns,
                                            Matcher => $matcher,
                                            CopyId => 1,
                                            CopyAttributes => 1 );

DESCRIPTION

XML::PatAct::ToObjects is a PerlSAX handler for applying pattern-action lists to \s-1XML\s0 parses or trees. XML::PatAct::ToObjects creates Perl objects of the types and contents of the action items you define.

New XML::PatAct::ToObject instances are creating by calling `new()'. Parameters can be passed as a list of key, value pairs or a hash. `new()' requires the Patterns and Matcher parameters, the rest are optional:

Patterns

The pattern-action list to apply.

Matcher

An instance of the pattern or query matching module.

CopyId

Causes the `\s-1ID\s0' attribute, if any, in a source \s-1XML\s0 element to be copied to an `\s-1ID\s0' attribute in newly created objects. Note that IDs may be lost of no pattern matches that element or an object is not created (\*(C`-make\*(C') for that element.

CopyAttributes

Causes all attributes of the element to be copied to the newly created objects.

Each action can either be a list of options defined below or a string containing a fragment of Perl code. If the action is a string of Perl code then simple then some simple substitutions are made as described further below.

Options that can be used in an action item containing an option-list:

-holder

Ignore this element, but continue processing it's children (compare to -ignore). \*(C`-pcdata\*(C' may be used with this option.

-ignore

Ignore (discard) this element and it's children (compare to -holder).

-pcdata

Character data in this element should be copied to the \*(C`Contents\*(C' field.

-make \s-1PACKAGE\s0

Create an object blessed into \s-1PACKAGE\s0, and continue processing this element and it's children. \s-1PACKAGE\s0 may be the type `\*(C`HASH\*(C'' to simply create an anonyous hash.

-args \s-1ARGUMENTS\s0

Use \s-1ARGUMENTS\s0 in creating the object specified by -make. This is commonly used to copy element attributes into fields in the newly created object. For example: -make => 'HASH', -args => 'URL => %{href}' would copy the `\*(C`href\*(C'' attribute in an element to the `\*(C`URL\*(C'' field of the newly created hash.

-field \s-1FIELD\s0

Store this element, object, or children of this element in the parent object's field named by \s-1FIELD\s0.

-push-field \s-1FIELD\s0

Similar to -field, except that \s-1FIELD\s0 is an array and the contents are pushed onto that array.

-value \s-1VALUE\s0

Use \s-1VALUE\s0 as a literal value to store in \s-1FIELD\s0, otherwise ignoring this element and it's children. Only valid with -field or -push-field. `\*(C`%{ATTRIBUTE}\*(C'' notation can be used to substitute the value of an attribute into the literal value.

-as-string

Convert the contents of this element to a string (as in \*(C`XML::Grove::AsString\*(C') and store in \s-1FIELD\s0. Only valid with -field or -push-field.

-grove

Copy this element to \s-1FIELD\s0 without further processing. The element can then be processed later as the Perl objects are manipulated. Only valid with -field or -push-field. If ToObjects is used with PerlSAX, this will use XML::Grove::Builder to build the grove element.

-grove-contents

Used with -make, -grove-contents creates an object but then takes all of the content of that element and stores it in Contents.

If an action item is a string, that string is treated as a fragment of Perl code. The following simple substitutions are performed on the fragment to provide easy access to the information being converted: The object that caused this action to be called. If ToObjects is used with PerlSAX this will be a hash with the element name and attributes, with XML::Grove this will be the element object, with Data::Grove it will be the matching object, and with \s-1XML::DOM\s0 it will be an XML::DOM::Element.

EXAMPLE

The example pattern-action list below will convert the following \s-1XML\s0 representing a Database schema:

<schema> <table> <name>MyTable</name> <summary>A short summary</summary> <description>A long description that may contain a subset of HTML</description> <column> <name>MyColumn1</name> <summary>A short summary</summary> <description>A long description</description> <unique/> <non-null/> <default>42</default> </column> </table> </schema>

into Perl objects looking like:

[ { Name => "MyTable", Summary => "A short summary", Description => $grove_object, Columns => [ { Name => "MyColumn1", Summary => "A short summary", Description => $grove_object, Unique => 1, NonNull => 1, Default => 42 } ] } ]

Here is a Perl script and pattern-action list that will perform the conversion using the simple name matching pattern module XML::PatAct::MatchName. The script accepts a Schema \s-1XML\s0 file as an argument ($ARGV[0]) to the script. This script creates a grove as one of it's objects, so it requires the XML::Grove module.

use XML::Parser::PerlSAX; use XML::PatAct::MatchName; use XML::PatAct::ToObjects;

my $patterns = [ 'schema' => [ qw{ -holder } ], 'table' => [ qw{ -make Schema::Table } ], 'name' => [ qw{ -field Name -as-string } ], 'summary' => [ qw{ -field Summary -as-string } ], 'description' => [ qw{ -field Description -grove } ], 'column' => [ qw{ -make Schema::Column -push-field Columns } ], 'unique' => [ qw{ -field Unique -value 1 } ], 'non-null' => [ qw{ -field NonNull -value 1 } ], 'default' => [ qw{ -field Default -as-string } ], ];

my $matcher = XML::PatAct::MatchName->new( Patterns => $patterns ); my $handler = XML::PatAct::ToObjects->new( Patterns => $patterns, Matcher => $matcher);

my $parser = XML::Parser::PerlSAX->new( Handler => $handler ); my $schema = $parser->parse(Source => { SystemId => $ARGV[0] } );

TODO

  • It'd be nice if patterns could be applied even in -as-string and -grove.

  • Implement Perl code actions.

  • -as-xml to write \s-1XML\s0 into the field.

AUTHOR

Ken MacLeod, [email protected]

RELATED TO XML::PatAct::ToObjects…

perl\|(1), Data::Grove\|(3)

``Using PatAct Modules'' and ``Creating PatAct Modules'' in libxml-perl.