Joho the Blog
An Entry from the Archives

« [etech] Day 2 - Tom Igoe - Net Objects || Back to Blog | [etech] Programme Information Pages »

March 16, 2005

[etech] Clay Shirky: Ontologies and Tags

Clay talks about how taxonomies always have values built in. Even the periodic table's "noble gases" division reflects an assumption about the "essential" state of elements. He points to the Dewey Decimal System's skewed religion category. [Yikes! I've been doing that, too! I probably heard it from Clay first. I will attribute it from now on. Ack!] Even the Library of Congress puts the Balkan Peninsula and African on equal footing because it's measuring the number of books on the shelves. The categorization reflects not the ideas but the physical storage.

He points out, that even though Yahoo has cross links, it has a concept of which is the real categorization; it only shows you the number of links in a category if you are seeing the category in its "real" environment.

Hierarchical systems, he says, inevitably adds hyperlinks to cut across the taxonomy. (Yahoo only allows three, he says, because they didn't want to be spammed.) If you have enough links, you don't need the hierarchy. "There is no shelf."

When does ontological organization work? When you don't have a lot of stuff, it's stable, things have clear edges, an authoritative source and trained users. I.e., the opposite of the Web.

People have assumed that tags that mean the same thing are actually the same, but (Clay says) "movie" people don't want to hang out with "cinema" people, and "queer" people certainly don't want to hang out with "homosexual" people. There is information in the differences that thesauruses and categorization schemes miss.

He shows graphs that provide evidence that tagging forms power laws — who tags, and the tags that individuals use.

Organic categorization uses market logic, merged from URLs not categories. The categories overlap, not synch. The mergers are probabilistic, not binary. User and time are core attributes; you can do grouping, inclusion/exclusion, and decay. The semantics are in the users, not in the system; this is not a way to get computers to understand one another.

[Great talk. As always.] [I spoke with Clay afterwards and he talked about Dewey two years ago during an etech talk I must have been at. I apologized. He was, of course, gracious. I will be adding the phrase, "As Clay says" to yet more of what I say. Happily.] [Technorati tags: etech shirky taxonomy]

Posted by D. Weinberger at March 16, 2005 07:22 PM


TrackBack

Listed below are links to weblogs that reference [etech] Clay Shirky: Ontologies and Tags:

» Blogging from BlogSoCool
Do you think blogging is cool?? [Read More]

Tracked on March 16, 2005 08:26 PM

» David's note on Clay Shirky: Ontologies and Tags from RatcliffeBrowse
Joho the Blog: [etech] Clay Shirky: Ontologies and Tags: He shows graphs that provide evidence that tagging forms power laws — who tags, and the tags that individuals use. Organic categorization uses market logic, merged from URLs not categories.... [Read More]

Tracked on March 19, 2005 12:03 AM

» SXSW Recap... from Epistemographer
Spent the past few days mulling over the heady experience of South by Southwest Interactive…I’ve been thinking out loud to a few friends in the time since I got back on Wed. night, and I think I’ve processed enough to... [Read More]

Tracked on March 21, 2005 02:27 AM

» SXSWi Recap... from Epistemographer
Spent the past few days mulling over the heady experience of South by Southwest Interactive…I’ve been thinking out loud to a few friends in the time since I got back on Wed. night, and I think I’ve processed enough to... [Read More]

Tracked on March 21, 2005 02:34 AM

Comments

The part of Dewey that I find risible is that they have these top-level numbers:

137 Divinatory graphology
139 Phrenology

The structure of the 400s is rather telling too...

Posted by: Kevin Marks | March 16, 2005 08:27 PM


Borges makes much the same point in the same essay where he goes through the supposed Chinese classification of animals:

The Bibliographic Institute of Brussels exerts chaos too: it has divided the universe into 1000 subdivisions, from which number 262 is the pope; number 282, the Roman Catholic Church; 263, the Day of the Lord; 268 Sunday schools; 298, mormonism; and number 294, brahmanism, buddhism, shintoism and taoism.

[http://www.alamut.com/subj/artiface/language/johnWilkins.html]

Posted by: oedipa | March 17, 2005 03:44 AM


Librarians have been criticizing the skewed categories in Dewey for decades. That's why the LC taxonomy was developed. I don't see why you have to "attribute" Clay when it's a truism.

Posted by: K.G. Schneider | March 19, 2005 06:40 PM


Clay's point about the Balkan Peninsula, Africa, and Asia in LCSH makes a similar point, both of which are artifacts stemming from flawed premises about hierarchical arrangements and temporality.

Posted by: Chris L | March 24, 2005 06:56 PM


Post a comment

Guidelines for Commenting

Basically, you can say what you want. (Click here for the fine print.)

If you haven't left a comment here before, your comment may be put into a queue for me to approve. Sorry for the delay. Blame the damn spammers.