/blog/Markup languages: who’s who?

markup languages timelineIs HTML XML?  This question came up in a conversation with Sarah and @k8lin, and ended up being harder than I thought it’d be.  There seems to be a fair amount of confusion on the topic, especially given the W3C’s recent abandonment of XHTML 2.0 and growing use of HTML5.

So, I decided to lay it all out in a (relatively) simple timeline format; as far as I know, this doesn’t exist anywhere else.  You’re welcome, The Internet.  Below are my sources and some notes; where possible, links are to the original recommendations or RFCs:

SGML is an ISO standard from the 80′s.  Unlike the other standards on this list, it’s not open (the ISO sells copies for >$200).  HTML is an “SGML application“, and has been from the beginning. The Wikipedia article has a lot more information on its origins, as does the W3C.

HTML 2.0 and HTML 3.2 , the first two W3C specs, are both pretty straightforward. Also straightforward is XML, which dropped in February 1998. Like HTML, XML is “an application profile…of SGML.”

In December 1999, the HTML 4.01 recommendation came out, followed a month later by XHTML 1.0.  The important thing to note is that both of these are still HTML 4; however, XHTML is “a reformulation of HTML 4 as an XML 1.0 application,” while HTML 4.01 is still plain ol’ SGML.

No one knows yet exactly what HTML5 is going to look like, as it’s still several years off.  However, the W3C tells us that the HTML5, like HTML4, is going to have two different “serializations.”  One will be an XML syntax, and is currently being called XHTML 5 (wait, why not “XHTML 2?”  Hang on, we’ll get there). You might expect that the other serialization would be SGML a la HTML 4.01.  You’d be wrong.

Although HTML is technically SGML, most browsers and authoring tools couldn’t care less about the broader SGML standard; they just implement HTML.  So the W3C’s plan seems to be to ditch the SGML legecy and replace it with “html” (note the lowercase), an entirely new standard…which happens to look pretty much like HTML has always looked.

Whew, we’re almost done.  OK, what about XHTML2?  Despite the name, project was not a “next step;” it was a huge break with the whole HTML/XHTML tradition, an effort to completely remake web markup.  In July, the W3C decided to let it die on the vine and focus on HTML5.  So XHTML 5, with its HTML lineage, will be a more incremental change than XHTML 2 would’ve been.

There you have it.  If I missed anything or got something turned around, let me know.

Post a Comment

(required)
(required; never shared)