The choice of clear text for the storage of XML documents, rather than any binary format, is important, if controversial for some. A binary solution would have been more efficient from a machine-processing point of view and would allow much smaller files. Having XML documents in plain text, however, means that users can intervene manually to examine or edit them if they wish. Keeping both the content and the containing markup as plain text allows users and programs alike to identify easily not only the structures foreseen by an author but also the intention of those structures.
In reality, many XML systems – whether processing or storing XML – use proprietary binary formats internally to improving operational efficiency. XML in its native plain text form is most useful when exchanging information between systems.
As we saw in the opening chapter, there is no longer a premium on ‘terseness’, so the added verbosity of XML is not a problem from the point of view of storage. Literally ‘spelling things out’ is the key to XML’s role in interoperability between information systems.
Posted in: Uncategorized | No Comments »