Feeds

XML - past, present and future

Chewing the fat with MS's Jean Paoli

  • alert
  • submit to reddit

The Power of One eBook: Top reasons to choose HP BladeSystem

Last week I had the pleasure of meeting up with Jean Paoli of Microsoft. In November, Jean was presented with the XML Cup 2004 to recognise his lifelong work in XML and its precursor SGML. The meeting gave me an opportunity to hear about the fascinating history of XML and understand some of its importance to Microsoft and the industry.

Jean Paoli was one of the leading members of the original XML working party and he had been working with SGML since 1985. SGML was a mark-up language that was mainly designed to allow manufacturers to pass complex design documents around. It worked very well at that task but never found its way into the mainstream of computing. Its biggest problem was its size, the specification was about a thousand pages and there was only one parser that implemented the complete standard. The other problem was that it was document centric, rather than data centric.

When Jean joined Microsoft in April 1996, officially to help develop IE4, it was a good chance to put into practice ideas that had floated around the SGML community for several years. Jean helped set up the first W3C committee for XML and by the end of the year 80 per cent of the standard was complete. Jean found that his knowledge and understanding of the power of SGML and mark-up languages in general, combined with the Microsoft engineers’ passion and understanding of simplicity and ease of use, enabled him to define XML. The XML specification was less than five per cent the size of SGML but in many ways more powerful.

Defining XML was Jean’s night job and during the day he helped develop Internet Explorer 4.0. The two came together by XML support being included in IE4 when it was launched at the end of 97. This was the time of the IE-Netscape wars and that discussion rather overshadowed the really important new bit of IE that was the XML support. Included in IE4 was the implementation of CDF (the precursor of RSS) which was the first use of XML. The importance of CDF was that it showed the power of XML to transport data from one environment to another in such a way that the producers and consumers did not need to have any direct knowledge of each others environments.

The amazing thing about this story is the speed at which it happened; less than two years from a standards committee being set up, to product coming out in the market, is unusual. This happened because the requirement was well understood and Bill Gates recognised its importance and gave it his backing.

XML is now imbedded into most of Microsoft’s products and central to all of its strategy. And, as they say... the rest is history.

I asked Jean about WordML. When it was first announced, it seemed very Office-centric to me, and I felt that it should have been a more generalised document mark-up language. Jean explained that the raison d’etre for WordML is for archiving Word documents. There is a real problem with documents that have to be kept for a long time (think of birth certificates) if they are stored in internal Word format. The problem is that in 30 years' time they will probably be unreadable as the software will have moved on, let alone 100 years from now. So there is a need to be able to store these documents in a vendor and software neutral format and that is what WordML is designed to do. The schema definition is open source so that anyone can write a parser at any time to read and format the documents. To do this, WordML has to support all the functionality and the quirkiness of Word, and hence the WordML schema is by definition Word-centric.

On the other hand, what is more generally important is Offices’ support of any XML schema. This is an area that has quietly grown up and the first tech conference on the subject last week attracted more than 500 delegates.

© IT-analysis.com

Related stories

XML Tower of Babel - bring on UBL
EDS and Opsware: bringing XML to the data centre
XML machine the successor to von Neumann?

HP ProLiant Gen8: Integrated lifecycle automation

More from The Register

next story
HIDDEN packet sniffer spy tech in MILLIONS of iPhones, iPads – expert
Don't panic though – Apple's backdoor is not wide open to all, guru tells us
Do YOU work at Microsoft? Um. Are you SURE about that?
Nokia and marketing types first to get the bullet, says report
Microsoft takes on Chromebook with low-cost Windows laptops
Redmond's chief salesman: We're taking 'hard' decisions
Cheer up, Nokia fans. It can start making mobes again in 18 months
The real winner of the Nokia sale is *drumroll* ... Nokia
EU dons gloves, pokes Google's deals with Android mobe makers
El Reg cops a squint at investigatory letters
Chrome browser has been DRAINING PC batteries for YEARS
Google is only now fixing ancient, energy-sapping bug
Big Blue Apple: IBM to sell iPads, iPhones to enterprises
iOS/2 gear loaded with apps for big biz ... uh oh BlackBerry
prev story

Whitepapers

Reducing security risks from open source software
Follow a few strategies and your organization can gain the full benefits of open source and the cloud without compromising the security of your applications.
Consolidation: The Foundation for IT Business Transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.
Application security programs and practises
Follow a few strategies and your organization can gain the full benefits of open source and the cloud without compromising the security of your applications.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.
Consolidation: the foundation for IT and business transformation
In this whitepaper learn how effective consolidation of IT and business resources can enable multiple, meaningful business benefits.