Joho the Blog
An Entry from the Archives

« Amazon's odd words || Back to Blog | Fun with anagrams »

March 18, 2005

[etech] My birds of a feather on taxonomy and tags

One last report on etech. I led a "birds of a feather" discussion on Tuesday night, a circle-the-chairs sort of thing for thirty people. The topic was "From Trees to Tags," and I began with about 12 minutes on the topic. This is from the speaking notes I took into the session, so what follows is highly approximate. It's also highly redundant with other blog posts of mine:

I'm writing a book called "Everything Is Miscellaneous," so I convened this BoF selflishly, in order to use you and what you know.

One way of getting at what the book is about is to ask: Why is knowledge organized the way the real world is? It's not at all obvious that we should organize our ideas the way we organize our socks.

We've organized knowledge into trees, from Aristotle to Linnaeus to Dewey. You get a tree by doing the basic thing of lumping and splitting, and then splitting the lumps until you get to a lump that is too unitary or miscellaneous to bear any more splitting. But lumping and splitting has been constrained by physical limitations. For example:

1. A thing has to go in one pile or another. For Aristotle, this was expressed as the Law of Identity (A is A and A is not not-A), a pretty basic rule.

2. The way we lump and split is the same for everyone: If you own a clothing store and separate it into men's and women's departments, it's separated that way for everyone who enters.

3. The lumping and splitting is done by experts.

4. The person who owns the stuff also owns the organization of the stuff. You can't come into the clothing store and rearrange it the way that suits you.

5. Lumping and splitting results in a neat and clean order. It's clean-edged.

But now we're digitizing information, resulting in a third order of order in which we break the rules of real-world order:

1. Things can go in more than one pile - You put your e-store's hiking boots under shoes, men's and women's apparel, outdoor wear, popular items, items on sale, etc.

2. The arrangement can be different for each person.

3. You or your social group are the experts.

4. Users get to control the organization of the stuff.

5. Messiness is a virtue on the Web.

You can see much of this in the rise of tagging: Users create the metadata and anyone can figure out how to sort through it and organize it. It's out of the hands of the owners of the stuff being classified.

So, what I'm saying is that we're moving from thinking that the right way to arrange — and understand — things is to figure out the taxonomic tree ahead of time. Instead, make a big pile of leaves, each with lots of metadata, and allow users to add more metadata and to sort and categorize it as they need.

But there are problems with this, especially with regard to tags:

- One word can have many meanings, and one meaning can have many words. As tagging gets more popular, that'll be a bigger issue.

- If we form social groups based around how we use words, we run the risk of fragmenting ourselves further, this time around semantics.

- Folksonomies can reinforce homogeneity.

But, I'm hopeful. Ask why tagging is happening now. After all, we've been able to tag Word documents forever, but we don't. Why now? I think in part it's because we are tagging not just for ourselves. We're doing it socially, aware that we're making the Web better for others. Together we're building a new infrastructure of meaning, created by and for one another. We will figure out amazing things to do with this new social semantic construct. (Unfortunately, I didn't take notes during the discussion. Drat!) [Technorati tags: tags etech taxonomy]

Posted by D. Weinberger at March 18, 2005 03:37 PM


TrackBack

Listed below are links to weblogs that reference [etech] My birds of a feather on taxonomy and tags:

» David Weinberger on taxonomy and tags from Vidar Hokstad's random musings
David Weinberger has posted som short noted from his birds of a feather on taxonomy and tags from etech that's well worth a read. They sum up quite simply the differences between "real world" taxonomies, structured around the concept of... [Read More]

Tracked on March 19, 2005 07:45 AM

» Debunking the new ideologues from alex wright
Gene Smith has written an excellent critique of Clay Shirky's Ontology is Overrated thesis, arguing adroitly why these increasingly bold pronouncements about the significance of tagging are so troubling: the underlying trend towards ideology. At times ... [Read More]

Tracked on April 20, 2005 03:59 PM

» Debunking the new ideologues from alex wright
Gene Smith has written an excellent critique of Clay Shirky's Ontology is Overrated thesis, arguing adroitly why these increasingly bold pronouncements about the significance of tagging are so troubling: the underlying trend towards ideology. At times ... [Read More]

Tracked on April 20, 2005 04:05 PM

» Debunking the new ideologues from alex wright
Gene Smith has written an excellent critique of Clay Shirky's Ontology is Overrated thesis. While I'm as burned out as anyone by now with windbagging about tags and folksonomies, I do think Gene makes several refreshingly new points here. In... [Read More]

Tracked on April 20, 2005 06:17 PM

» Tags and ideology from alex wright
Gene Smith has written an excellent critique of Clay Shirky's Ontology is Overrated thesis. While I'm as burned out as anyone by now with windbagging about tags and folksonomies, I do think Gene makes several refreshingly new points here. In... [Read More]

Tracked on April 22, 2005 01:40 AM

» Tags as ideology from alex wright
Gene Smith has written an excellent critique of Clay Shirky's Ontology is Overrated thesis. While I'm as burned out as anyone by now with windbagging about tags and folksonomies, I do think Gene makes several refreshingly new points here. In... [Read More]

Tracked on April 22, 2005 01:41 AM

Comments

I wonder if Clay Shirky was in the circle and if you have substantial agreement or dis- regarding ontology?

I wonder if tags might not be less interesting than the label "folksonomy" makes them? Matt Mullenweg said, "I’ve had a couple of people ask how best to tag me in a photo, I’ve seen some use “mullenweg” or “matt” or “mattmullenweg” but the best bet (and the one I watch) is “photomatt“."

The researcher who MUST assemble snapshots of Matt will - if her IQ is anywhere above room temperature - infer these tags without Matt's direction. Over his lifetime, a long one I hope, there may be a convergence to "photomatt" if he becomes THE photomatt. There's another photomatt (Matthew Gilbert) who owns the .com URI. Mullenweg is the .net guy. Or, who knows, maybe the folks of 2020 will tag him as WordPresser... or something. The point is, this is all soft and loose and the discipline of semantic structural analysis and kraftwerk leading therefrom isn't something that the pop-techny community is putting a lot of energy into. Rather, I think we'll see the important infrastructure emerge from places like Cern and the Large Hadron Collider people or the Internet 2 effort or some serious (gag) corporate R&D. Our little virtual village grows incrementally with cool ideas like del.icio.us, but our virtual world is built on a well planned framework brought to us by big capital... the foundation pieces with huge price tags include things like Sprint's TAT 14 Cable System, Dense Wavelength Division Multiplexing, all that hoot.

I look around this office and see maybe 300 books. A couple of shelves are full of binders that each contain multiple datasets, but by and large the books are single titles and depending on where my work takes me I may need to draw on three or four of those titles to assemble info on a variety of subjects. My only clue as to what's on these shelves is that I thought when I shelved the book that it was one I'd like to have near me. My desks and bookshelves prove that everything is miscellaneous. I'm eager to read your book about that David.

Posted by: fp | March 18, 2005 09:50 PM


Frank, I agree with Clay to a distressing degree. His talk on the topic at eTech was brilliant, IMO, and I even discovered that(through the power of selective forgetting) I lifted the Dewey example I've been using from a talk I heard him give a couple of years ago.

Tagging would be happening without the happy "folksonomy" term, imo, because some apps give us good individual reasons to tag, which then turns out to have quite spectacular social effects. And I disagree that the tagging app providers aren't thinking about the hard issues that arise when you have millions of people using short strings to tag millions of objects. They are, which holds out hope that the solutions will be nicely tied to actual apps as opposed to being imposed context-free in a Dewey-like manner. Plus, we'll figure out a bunch of stuff bottom-up, just as in your photomatt example.

I think.

Posted by: David Weinberger | March 19, 2005 07:11 AM


I was curious if you've encountered the work of Gilles Deleuze. He's a very French philosopher, died in the 80s, but he's pretty huge over there. I focus on him in studies.
His most famous book, co-authored with Felix Guattari, is called A Thousand Plateaus. The first chapter is called "Introduction: The Rhizome." They contrast the mode of organizing beings and ideas in a "tree" form (based around, say, Platonism or abstract ideals tightly ordered by rules and codes) with organizating according to a "rhizome" form (forming horizontal bonds across very different beings that allows them to create new beings/concepts).
Your argument about organizing knowledge as trees sounds very similar to the implications they're getting at, and it made me wonder if there are any connections between your work and theirs. If not, you might like reading them. Like I said, very French, but ultimately very rewarding.

Manuel Delanda is a very good commentator on their work with several essays online. This one is very good:
"Meshworks, Hierarchies, and Interfaces"
http://t0.or.at/delanda/meshwork.htm

Posted by: donald | March 19, 2005 04:30 PM


"Ask why tagging is happening now?" ... Well this might be the first time that we have tools where tagging is not just lonely and tedious. The first time we can point to something and say *in a social manner* this on one of those. Our tools are finally working the way we learn language.

Posted by: Bozo Faust | March 19, 2005 06:20 PM


I know that Pattie Maes of MIT labs is doing some research with 'coRelate Networks' /Social networks.
======
"CoRelate Networks has mined online social network communities to create a rich influence network of interests and subcultures. Some of the interests represented in the network are, inter alia, television and films, foods, geographies, music, sports, hobbies, activities, objects, and people. The strength of connections between interests are learned from the digestion of on the order of one hundred thousand user profiles from online social networks. For example, a person who likes X may also mention Y as an interest on her homepage. Identity and tastes emerge as patterns of intersection on a coRelate network"
=======

Dave's, post on taxonomy and tags has really sparked some my interest's back again to KBE, smart agents and Knowledge Semantics. Tagging may just be the begining of true knowledge mangement in digital form for the global digital village. I am thinking that tagging is one portion of the puzzle. Today, we have tagging. Tomorrow, we'll have interest engines. The day after, we'll have smart agents, that corelate interests and tags and derive at meaning to that indivual user.. who knows .. what the social tapestry will look like in the future.. !!

Posted by: /pd | March 21, 2005 10:39 AM


"Today, we have tagging. Tomorrow, we'll have interest engines. The day after, we'll have smart agents, that corelate interests and tags and derive at meaning to that indivual user.. who knows .. what the social tapestry will look like in the future.. !!"

Well said, Peter. I'm looking forward to the day when we don't need to go through the rigamarole of making a group to discuss a topic, nor find some domain in a haystack where it is being dicussed by the small group who happened to find that discussion .... rather you we can just start posting to the topic and those interested will see it and respond.

Posted by: Bozo Faust | March 21, 2005 11:19 AM


David this is a very interesting thread. Organizing information into trees looks good and works reasonably well to start, but falls down quickly when the tree gets reasonably large. You can spend more time wandering around the tree than is good for you.

Better techniques are indeed needed to organinze, manage and locate information. I've recently had someone comment to me that adding keywords to content and then searching based solely on those keywords enables him to find any information he is after far quicker than other methods. I have yet to try this myself, but instinctively it sounds good and I'm looking forward giving it a go in Surfulater.

Posted by: Neville Franks | March 31, 2005 11:37 PM


Post a comment

Guidelines for Commenting

Basically, you can say what you want. (Click here for the fine print.)

If you haven't left a comment here before, your comment may be put into a queue for me to approve. Sorry for the delay. Blame the damn spammers.