by Thomas Erl
 XML Data Strategy
divider
 Meta-Enable Your
 Enterprise
Despite the many ways XML has been utilized, its core purpose has always been to help the Internet separate presentation from content. By its very nature as a meta language, XML supplies a framework for the creation of custom meta tags.
Aside from multi-media content, a standard Web page generally consists of HTML tags that supply formatting and layout commands to the browser, and ASCII text that makes up the Web page content. XML tags add intelligence to Web page content, by defining a document structure (such as an invoice), and by identifying and classifying each piece of relevant text within the document body (such as the invoice number, customer name, invoice amount, etc.).
Theoretically, all documents within an organization could be XML-enabled, providing a universal standard classification and categorization system. The most immediate benefit would be the addition of advanced searching capabilities, which could substantially streamline the process of locating information.
Enhancing the quality of corporate content this way, however, will ultimately lead to a superior document-sharing environment with advanced document management capabilities. The accessibility and programmability of these documents would be comparable to (and even interoperable with) information stored in traditional relational databases (as explained in Unifying Corporate Data and Documents).
Using XML this way is not a huge leap of faith anymore. The technology is part of the mainstream, simple in nature, and produced by the same standards organization responsible for SGML (the parent language of HTML and XML).
It is the actual implementation within an enterprise, however, where the challenges lie. Standardizing on XML as the document format for all published documents will significantly affect many parts of an organization, and will undoubtedly require a large and expensive conversion effort. Here are some of the major issues to look out for:

Vocabulary and Schema Authoring
Perhaps the most important factor to consider when introducing XML on any level, is the creation and maintenance of XML vocabularies, their associated document structures, and the encapsulation of corporate business rules in DTDs or Schemas. In larger organizations, this responsibility can evolve into a new role: the XML Data Custodian.

Publishing Tools
New tools will likely be required in order to allow non-technical users to convert existing documents or author new documents in the XML format. XML editing and publishing products come in all shapes and sizes. The key is to find a set of tools that are easy to use, yet still robust enough to handle large volumes of documents, batch processing, and multi-document meta tagging. It is also wise to settle on a set of tools compatible with your existing technical environment, as it may become necessary to integrate these products with your existing applications, or perhaps programmatically automate portions of the conversion process.

Publishing Process
Any existing publishing process you may have will need to be altered in order to incorporate new publishing steps based on the use of new editing and publishing tools. If your organization uses a centralized model to manage the publishing processes for your intranet, for instance, the impact of XML will be limited to publishers (as opposed to users) having to learn new editing and publishing tools.
 

return to homepage


SOA: Principles
of Service Design

by Thomas Erl

An in-depth guide dedicated to service engineering with a thorough exploration of the design principles that comprise the service-orientation design paradigm (including a comparison with object-orientation).


Service-Oriented Architecture:
Concepts, Technology, and Design

by Thomas Erl

The first "how-to" guide to building SOA, providing coverage of WS-* specifications, .NET and J2EE platforms, and step-by-step processes for service-oriented analysis and design.
Service-Oriented Architecture:
A Field Guide
to Integrating
XML and Web Services

by Thomas Erl

The best-selling guide to service-oriented integration, providing hundreds of integration strategies and over sixty best practices.

For more information about either book, visit: www.soabooks.com

About SOA Systems

SOA Systems Inc. provides strategic SOA consulting services and offers a comprehensive SOA training program.

For more information see:

•  www.soasystems.com

•  www.soatraining.com

   
   

Corporate Intranet Upgrade
In order to support the .xml document format and take advantage of the new searching and document management features introduced by the XML format, your intranet may need to undergo an upgrade. Also, because an XML document adds meta data to the document content, it may result in a requirement for increased storage space.

Existing Document Formats
Another major obstacle to establishing XML as a standard document format within some organizations, is that a standard document format may already exist. MS Word, for example, may be the most popular document format in the world. Experts have speculated, however, that it is unlikely that Microsoft will ever want Word to fully support an open XML format, as it would lessen the strategic value of its proprietary .doc format. This will potentially increase the effort of converting Word documents and may require the involvement of third-party tools.

Deciding on XML as your standard document format is a pretty safe bet, as it is highly unlikely that another meta-enabling technology will surface and succeed XML as the industry-standard meta document format during the next decade.
Adopting XML as the standard format for documents within your organization, however, will require a significant financial investment, as well as a great deal of effort. The benefits, though mostly strategic, are likely to reward you for years to come; as far as information access goes, XML is as good as it gets.

   
   
Home   Copyright © 1999-2007 SOA Systems Inc.