SYNOPSIS

This is an article, not a module.

SUMMARY

MKDoc is a web content management system written in Perl which focuses on standards compliance, accessiblity and usability issues, and multi-lingual websites.

At MKDoc Ltd we have decided to gradually break up our existing commercial software into a collection of completely independent, well-documented, well-tested open-source \s-1CPAN\s0 modules.

Ultimately we want MKDoc code to be a coherent collection of module distributions, yet each distribution should be usable and useful in itself.

MKDoc::XML is part of this effort.

You could help us and turn some of MKDoc's code into a \s-1CPAN\s0 module. You can take a look at the existing code at http://download.mkdoc.org/.

If you are interested in some functionality which you would like to see as a standalone \s-1CPAN\s0 module, send an email to <[email protected]>.

DISCLAIMER

MKDoc::XML is a low level \s-1XML\s0 library.
MKDoc::XML::* modules do not make sure your \s-1XML\s0 is well-formed.
MKDoc::XML::* modules can be used to work with somehow broken \s-1XML\s0.
MKDoc::XML::* modules should not be used as high-level parsers with general purpose \s-1XML\s0 unless you know what you're doing.

WHAT'S IN THE BOX

\s-1XML\s0 tokenizer

MKDoc::XML::Tokenizer splits your \s-1XML\s0 / \s-1XHTML\s0 files into a list of MKDoc::XML::Token objects using a single regex.

\s-1XML\s0 tree builder

MKDoc::XML::TreeBuilder sits on top of MKDoc::XML::Tokenizer and builds parsed trees out of your \s-1XML\s0 / \s-1XHTML\s0 data.

\s-1XML\s0 stripper

MKDoc::XML::Stripper objects removes unwanted markup from your \s-1XML\s0 / \s-1HTML\s0 data. Useful to remove all those nasty presentational tags or 'style' attributes from your \s-1XHTML\s0 data for example.

\s-1XML\s0 tagger

MKDoc::XML::Tagger module matches expressions in \s-1XML\s0 / \s-1XHTML\s0 documents and tag them appropriately. For example, you could automatically hyperlink certain glossary words or add <abbr> tags based on a dictionary of abbreviations and acronyms.

\s-1XML\s0 entity decoder

MKDoc::XML::Decode is a pluggable, configurable entity expander module which currently supports html entities, numerical entities and basic xml entities.

\s-1XML\s0 entity encoder

MKDoc::XML::Encode does the exact reverse operation as MKDoc::XML::Decode.

\s-1XML\s0 Dumper

MKDoc::XML::Dumper serializes arbitrarily complex perl structures into \s-1XML\s0 strings. It is also able of doing the reverse operation, i.e. deserializing an \s-1XML\s0 string into a perl structure.

AUTHOR

Copyright 2003 - MKDoc Holdings Ltd.

Author: Jean-Michel Hiver

This module is free software and is distributed under the same license as Perl itself. Use it at your own risk.

RELATED TO MKDoc::XML…

  Petal: http://search.cpan.org/dist/Petal/
  MKDoc: http://www.mkdoc.com/

Help us open-source MKDoc. Join the mkdoc-modules mailing list:

[email protected]