Monday, April 23, 2007

Metadata: The Eternally Red Herring

KML's shiny <Metadata> tag is the toast of the town, but I think the GIS crowd drank a little too much ESRI Kool-Aid this decade to realize the implications. Paul Ramsey writes:

Where will this all end? I think it will end with the Google Team picking one or a few <Metadata> encodings to expose in their user interfaces (Earth and Maps). At that point all content will converge rapidly on that encoding, and the flexibility of <Metadata> will be rapidly ignored.

The very word "metadata" reeks of sadness. It speaks of librarians locked in dark rooms trying to figure out what data means, long after those who created it are onto the next burning crisis. It smells like scores of sad committee meanings hashing out arcane XML details that no one will ever write a consistent reader or writer for.

It is amazing to me that anyone would want to use the word for anything anymore. But Google did, and people took them at their word. But what Google really meant — in traditional GIS speak — is attribute storage. As they point out in their FAQ article:

Without the <Metadata> tag, KML authors had to either send two files, one with metadata and one with KML, or place their data in the <description> tag

Sound familiar? It's a SHP and DBF file. But now you can marry attribute data to your geometries in one file. And because it can be arbitrary XML, it can be a lot more interesting than boring rows of attributes.

Perhaps only a few big encodings (like a boring row of attributes) will be supported out of the box by Google Earth, but you'll find every XML spec under the sun suddenly glued inside KML documents, and the great sleeping beast of a thousand enterprise applications will suddenly awake from its slumber and devour GIS applications everywhere. Whether you put KML inside a more traditional XML document, or XML inside your KML, it comes to the same thing. It's a smarter, more modern shape file. It's not the usual GIS "metadata" malarkey — it's the real data.

3 Comments:

At April 24, 2007 10:04 AM, Paul Ramsey said...

Sebastian, the sadness is not because I think that Metadata is, well metadata, but because the lesson of GML is clear -- if you make a tool flexible enough to contain absolutely anything, it will end up usefully containing absolutely nothing. Jason's comment on my blog was revealing -- Schema was the shapefile, and it has been deprecated in favour of... whatever Metadata becomes. Which might, in the end, just be the shapefile again.

 
At April 24, 2007 11:27 AM, Sebastian Good said...

I see your point, Paul. I guess my assumption is that with KML actually being deployed on a huge platform there is a chance that the sort of XML-y goodness that might have happened with GML will actually happen. Perhaps the analogy is with SGML and HTML. HTML is but a pale shadow of SGML, but outside of a very specific community, no one uses SGML. And it's for no other reason than Tim Berners Lee decided to do HTML instead.

 
At April 24, 2007 12:50 PM, Jeremy Cothran said...

Generally I agree with Paul's comment that the Metadata tag hopefully acts as a transitional placeholder for a handful of later popular xml encodings(for my work in regards to observation oriented data). Until then I'll gnash my teeth as yet another possibly useful KML data source gets hopelessly buried in the presentation details, rising only for other similar end-user incarnations which are equally unshareable at the machine level(lack of consistent xml schemas or controlled vocabularies).

 

Post a Comment

Links to this post:

Create a Link

<< Home