<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Joho the Blog &#187; everything_is_miscellaneous</title>
	<atom:link href="http://www.hyperorg.com/blogger/tag/everything_is_miscellaneous/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.hyperorg.com/blogger</link>
	<description>Let's just see what happens</description>
	<lastBuildDate>Sat, 15 Jun 2013 21:48:22 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Shirky&#8217;s myth of complexity</title>
		<link>http://www.hyperorg.com/blogger/2010/04/05/shirkys-myth-of-complexity/</link>
		<comments>http://www.hyperorg.com/blogger/2010/04/05/shirkys-myth-of-complexity/#comments</comments>
		<pubDate>Mon, 05 Apr 2010 12:10:21 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[business]]></category>
		<category><![CDATA[media]]></category>
		<category><![CDATA[everythingIsMiscellaneous]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[shirky]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/?p=9282</guid>
		<description><![CDATA[Clay Shirky has given us a surprising number of Internet myths. And by this I mean not falsehoods but the opposite: Broad, illuminating ways of making sense of what&#8217;s going on. For example, Clay&#8217;s post about the power law distribution of links in the blogosphere (based on research by Cameron Marlow) changed how we view [...]]]></description>
				<content:encoded><![CDATA[<p>Clay Shirky has given us a surprising number of Internet myths. And by this I mean not falsehoods but the opposite: Broad, illuminating ways of making sense of what&#8217;s going on. For example, <a href="http://www.shirky.com/writings/powerlaw_weblog.html">Clay&#8217;s post about the power law distribution</a> of links in the blogosphere (based on <a href="http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.129.6561&#038;rep=rep1&#038;type=pdf">research by Cameron Marlow</a>) changed how we view authority, fame, and success in the Web ecosystem, and provided the structure within which Chris Anderson could point to the <a href="http://en.wikipedia.org/wiki/Long_Tail">Long Tail</a>. And Clay&#8217;s <a href="http://www.shirky.com/writings/ontology_overrated.html">Ontology Is Overrated</a> made clear that a change in how we categorize our world affects very real power relationships; that essay was highly influential, including on my own <a href="http://www.everythingismiscellaneous.com">Everything Is Miscellaneous</a>. </p>
<p>Clay&#8217;s new post â€” <a href="http://www.shirky.com/weblog/2010/04/the-collapse-of-complex-business-models/">The Collapse of Complex Business Models</a> â€” gives us a broad way of understanding why those who used to provide us with content will not be the ones who give us content in the future&#8230;and why they cannot fathom why not.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2010/04/05/shirkys-myth-of-complexity/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Google Books metadata: Google responds</title>
		<link>http://www.hyperorg.com/blogger/2009/09/08/google-books-metadata-google-responds/</link>
		<comments>http://www.hyperorg.com/blogger/2009/09/08/google-books-metadata-google-responds/#comments</comments>
		<pubDate>Tue, 08 Sep 2009 12:18:23 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[books]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[google_books]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[metadata]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/2009/09/08/google-books-metadata-google-responds/</guid>
		<description><![CDATA[There&#8217;s a terrific colloquy between Google and Geoff Nunberg in response to Geoff&#8217;s critique of Google&#8217;s handling of the metadata attached to the books Google is digitizing (which I blogged about here). It&#8217;s fascinating for its content, but also very cool as a conversation between a company and its market. Of course, it would have [...]]]></description>
				<content:encoded><![CDATA[<p>There&#8217;s a terrific <a href="http://languagelog.ldc.upenn.edu/nll/?p=1701#comment-41758">colloquy</a> between Google and Geoff Nunberg in response to Geoff&#8217;s critique of  Google&#8217;s handling of the metadata attached to the books Google is digitizing (which I blogged about <a href="http://www.hyperorg.com/blogger/2009/09/04/google-books-metadata-meta-wreck/">here</a>).  It&#8217;s fascinating for its content, but also very cool as a conversation between a company and its market. Of course, it would have been even better if Google had initiated this conversation when it started its digitization project.</p>
<p><span id="tagspan" class='tags'>[Tags: <a href="http://www.technorati.com/tag/berkman" rel="tag"></a> <a href="http://www.technorati.com/tag/metadata" rel="tag">metadata</a> <a href="http://www.technorati.com/tag/books" rel="tag">books</a> <a href="http://www.technorati.com/tag/google books" rel="tag">google_books</a> <a href="http://www.technorati.com/tag/libraries" rel="tag">libraries</a> <a href="http://www.technorati.com/tag/everything is miscellaneous" rel="tag">everything_is_miscellaneous</a> ]</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2009/09/08/google-books-metadata-google-responds/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Data and metadata: Together again</title>
		<link>http://www.hyperorg.com/blogger/2009/09/06/data-and-metadata-together-again/</link>
		<comments>http://www.hyperorg.com/blogger/2009/09/06/data-and-metadata-together-again/#comments</comments>
		<pubDate>Sun, 06 Sep 2009 14:00:10 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[databases]]></category>
		<category><![CDATA[everythingIsMiscellaneous]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[fluiddb]]></category>
		<category><![CDATA[knowledge]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[philosophy]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/?p=8586</guid>
		<description><![CDATA[Terry Jones has an excellent post that lists the problems introduced by maintaining a hard distinction between metadata and data. Terry cites Everything Is Miscellaneous (thanks, Terry), which argues that the distinction, which is hard-coded in the Age of Databases, becomes a merely functional difference in the Age of Messy Links: Metadata is what you [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://blogs.fluidinfo.com/fluidDB/2009/09/05/metadata-vs-data-a-wholly-artificial-distinction/">Terry Jones</a> has an excellent post that lists the problems introduced by maintaining a  hard distinction between metadata and data. </p>
<p>
Terry cites <a href="http://www.EverythingIsMiscellaneous.com">Everything Is Miscellaneous</a> (thanks, Terry), which argues that the distinction, which is hard-coded in the Age of Databases, becomes a merely functional difference in the Age of Messy Links: Metadata is what you know and data is what you&#8217;re looking for. For example, the year of a CD is metadata about the CD if you know the year a Bob Dylan CD came out but you don&#8217;t remember the title, and the title can be metadata if you know the title but want to find the year. And in both cases, it could all be metadata in your search for  lyrics.  </p>
<p>
This is all very squishy and messy because the distinction is, as Terry says, artificial. It comes from thinking about experience as content that gets processed, as if we worked the way computers do. More exactly, it comes from thinking about experience as a set of  Experience Atoms that then have to be assembled; metadata are the labels that tell you that Atom A goes into Atom Z. But experience is far more like language than like particle physics or Ikea assembly instructions. And that&#8217;s for a very good reason: linguistic creatures&#8217; experience cannot be understood apart from language. Language doesn&#8217;t neatly separate into content and meta-content. It all comes together and it&#8217;s all intertwingled. Language is so very non-atomic that it makes atoms realize how lonely they&#8217;ve been.</p>
<p>
That doesn&#8217;t mean that computer software that separates metadata from data is useless. Lord knows I love a good database. But it also means that computer software that can treat anything as metadata depending on what we&#8217;re trying to do opens up some interesting possibilities&#8230;</p>
<p><span id="tagspan" class='tags'>[Tags: <a href="http://www.technorati.com/tag/berkman" rel="tag"></a> <a href="http://www.technorati.com/tag/everything is miscellaneous" rel="tag">everything_is_miscellaneous</a> <a href="http://www.technorati.com/tag/fluiddb" rel="tag">fluiddb</a> <a href="http://www.technorati.com/tag/metadata" rel="tag">metadata</a> <a href="http://www.technorati.com/tag/databases" rel="tag">databases</a> <a href="http://www.technorati.com/tag/language" rel="tag">language</a> ]</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2009/09/06/data-and-metadata-together-again/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Evolution of Evolution</title>
		<link>http://www.hyperorg.com/blogger/2009/09/06/evolution-of-evolution/</link>
		<comments>http://www.hyperorg.com/blogger/2009/09/06/evolution-of-evolution/#comments</comments>
		<pubDate>Sun, 06 Sep 2009 13:00:27 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[darwin]]></category>
		<category><![CDATA[drafts]]></category>
		<category><![CDATA[everythingIsMiscellaneous]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[evolution]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[transparency]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/2009/09/06/evolution-of-evolution/</guid>
		<description><![CDATA[Ben Fry posts an amazing visualization of the changes in the six editions of Darwin&#8217;s Origin of Species, based on meticulous work done by Dr. John van Wyhe and others. From Ben&#8217;s introductory text: The second edition, for instance, adds a notable &#8220;by the Creator&#8221; to the closing paragraph, giving greater attribution to a higher [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://benfry.com/traces/">Ben Fry posts an amazing visualization</a> of the changes in the six editions of Darwin&#8217;s Origin of Species, based on meticulous work done by Dr. John van Wyhe and others. From Ben&#8217;s introductory text:</p>
</p>
<blockquote><p>The second edition, for instance, adds a notable &#8220;by the Creator&#8221; to the closing paragraph, giving greater attribution to a higher power. In another example, the phrase &#8220;survival of the fittest&#8221; â€” usually considered central to the theory and often attributed to Darwin â€” instead came from British philosopher Herbert Spencer, and didn&#8217;t appear until the fifth edition of the text.</p>
</blockquote>
<p><span id="tagspan" class='tags'>[Tags: <a href="http://www.technorati.com/tag/berkman" rel="tag"></a> <a href="http://www.technorati.com/tag/darwin" rel="tag">darwin</a> <a href="http://www.technorati.com/tag/evolution" rel="tag">evolution</a> <a href="http://www.technorati.com/tag/drafts" rel="tag">drafts</a> <a href="http://www.technorati.com/tag/everything is miscellaneous" rel="tag">everything_is_miscellaneous</a> <a href="http://www.technorati.com/tag/transparency" rel="tag">transparency</a> ]</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2009/09/06/evolution-of-evolution/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The price of free law</title>
		<link>http://www.hyperorg.com/blogger/2009/09/04/the-price-of-free-law/</link>
		<comments>http://www.hyperorg.com/blogger/2009/09/04/the-price-of-free-law/#comments</comments>
		<pubDate>Fri, 04 Sep 2009 20:41:48 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[copyleft]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[digital rights]]></category>
		<category><![CDATA[everythingIsMiscellaneous]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[law]]></category>
		<category><![CDATA[pacer]]></category>
		<category><![CDATA[public_domain]]></category>
		<category><![CDATA[recap]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/2009/09/04/the-price-of-free-law/</guid>
		<description><![CDATA[The latest Radio Berkman episode has me interviewing Steve Schultze about his RECAP project that posts public domain legal records that otherwise you&#8217;d have to pay to access. And the federal courts are not all that happy about it. [Tags: law public_domain pacer recap copyright copyleft everything_is_miscellaneous ]]]></description>
				<content:encoded><![CDATA[<p>The <a href="http://blogs.law.harvard.edu/mediaberkman/2009/09/04/radio-berkman-129-i-bought-the-law/">latest Radio Berkman episode</a> has me interviewing Steve Schultze about his <a href="http://www.hyperorg.com/blogger/2009/08/18/recapthelaw-org/">RECAP project</a> that posts public domain legal records that otherwise you&#8217;d have to pay to access. And the federal courts are not all that happy about it.</p>
<p><span id="tagspan" class='tags'>[Tags: <a href="http://www.technorati.com/tag/berkman" rel="tag"></a> <a href="http://www.technorati.com/tag/law" rel="tag">law</a> <a href="http://www.technorati.com/tag/public domain" rel="tag">public_domain</a> <a href="http://www.technorati.com/tag/pacer" rel="tag">pacer</a> <a href="http://www.technorati.com/tag/recap" rel="tag">recap</a> <a href="http://www.technorati.com/tag/copyright" rel="tag">copyright</a> <a href="http://www.technorati.com/tag/copyleft" rel="tag">copyleft</a> <a href="http://www.technorati.com/tag/everything is miscellaneous" rel="tag">everything_is_miscellaneous</a> ]</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2009/09/04/the-price-of-free-law/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Books metadata meta-wreck</title>
		<link>http://www.hyperorg.com/blogger/2009/09/04/google-books-metadata-meta-wreck/</link>
		<comments>http://www.hyperorg.com/blogger/2009/09/04/google-books-metadata-meta-wreck/#comments</comments>
		<pubDate>Fri, 04 Sep 2009 13:16:01 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[everythingIsMiscellaneous]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[google_books]]></category>
		<category><![CDATA[libraries]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[worldcat]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/2009/09/04/google-books-metadata-meta-wreck/</guid>
		<description><![CDATA[Geoff Nunberg has a fantastic post warning about the poor quality of the metadata attached to the books Google is scanning into its soon to be dominant-to-the-point-of-monopoly digital library. Apparently, the attempt to gather metadata automatically from the scans has resulted in the introduction of legions of errors. But the real problems are, as Geoff [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://languagelog.ldc.upenn.edu/nll/?p=1701">Geoff Nunberg has a fantastic post</a> warning about the poor quality of the metadata attached to the books Google is scanning into its soon to be <a id="aptureLink_widvy3DJc3" href="http://en.wikipedia.org/wiki/Google%20Book%20Search%20Settlement%20Agreement">dominant-to-the-point-of-monopoly</a> digital library. Apparently, the attempt to gather metadata automatically from the scans has resulted in the introduction of legions of errors. But the real problems are, as Geoff points out, that Google seems not to have a plan for dealing with this problem and that it has not opened up the metadata design process.</p>
<p><span id="tagspan" class='tags'>[Tags: <a href="http://www.technorati.com/tag/berkman" rel="tag"></a> <a href="http://www.technorati.com/tag/google books" rel="tag">google_books</a> <a href="http://www.technorati.com/tag/libraries" rel="tag">libraries</a> <a href="http://www.technorati.com/tag/metadata" rel="tag">metadata</a> <a href="http://www.technorati.com/tag/worldcat" rel="tag">worldcat</a> <a href="http://www.technorati.com/tag/everything is miscellaneous" rel="tag">everything_is_miscellaneous</a> ]</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2009/09/04/google-books-metadata-meta-wreck/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Copyright&#8217;s creative disincentive</title>
		<link>http://www.hyperorg.com/blogger/2009/08/31/copyrights-creative-disincentive/</link>
		<comments>http://www.hyperorg.com/blogger/2009/08/31/copyrights-creative-disincentive/#comments</comments>
		<pubDate>Mon, 31 Aug 2009 18:31:38 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[canada]]></category>
		<category><![CDATA[copyleft]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[culture]]></category>
		<category><![CDATA[digital rights]]></category>
		<category><![CDATA[everythingIsMiscellaneous]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[policy]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/2009/08/31/copyrights-creative-disincentive/</guid>
		<description><![CDATA[Tucows is participating in the Canadian copyright consultation process. Rather than submitting a comment written in the usual lawyerly prose, Elliot Noss, Tucow&#8217;s CEO, asked me to write up something about copyright in my usual imprecise and incoherent prose. I like Elliot a lot, and I care about copyright, so I wrote about the argument [...]]]></description>
				<content:encoded><![CDATA[<p> <a href="http://www.tucowsinc.com">Tucows</a> is participating in the  <a href="http://copyright.econsultation.ca/">Canadian copyright consultation</a> process. Rather than submitting a comment written in the usual lawyerly prose, Elliot Noss, Tucow&#8217;s CEO, asked me to write up something about copyright in my usual  imprecise and incoherent prose.  I like Elliot a lot, and I care about copyright, so I wrote about the argument that without strong copyright protection, creators won&#8217;t have an incentive to create.  The piece is now <a href="http://tucowsinc.com/news/2009/08/copyrights-creative-disincentive/">posted</a>&#8230; [The next day: I absolutely should have mentioned that this was a commissioned piece. I.e., Elliot paid me to write something, and posted it unaltered.]</p>
<p><span id="tagspan" class='tags'>[Tags: <a href="http://www.technorati.com/tag/berkman" rel="tag"></a> <a href="http://www.technorati.com/tag/copyleft" rel="tag">copyleft</a> <a href="http://www.technorati.com/tag/copyright" rel="tag">copyright</a> <a href="http://www.technorati.com/tag/culture" rel="tag">culture</a> <a href="http://www.technorati.com/tag/canada" rel="tag">canada</a> <a href="http://www.technorati.com/tag/everything is miscellaneous" rel="tag">everything_is_miscellaneous</a> ]</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2009/08/31/copyrights-creative-disincentive/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Encyclopedia of Life &#8211; Now by Humans!</title>
		<link>http://www.hyperorg.com/blogger/2009/08/26/encyclopedia-of-life-now-by-humans/</link>
		<comments>http://www.hyperorg.com/blogger/2009/08/26/encyclopedia-of-life-now-by-humans/#comments</comments>
		<pubDate>Wed, 26 Aug 2009 14:56:51 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[biology]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[everythingIsMiscellaneous]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[taxonomy]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/2009/08/26/encyclopedia-of-life-now-by-humans/</guid>
		<description><![CDATA[The Encyclopedia of Life is encouraging citizen contributions to its experts-vetted pages, so far with what seem like excellent results. There&#8217;s a good article about this at Science Daily. After two years, they&#8217;ve got 150,000 species pages underway, with 1.4 million stubs awaiting drafting. [Tags: crowdsourcing everything_is_miscellaneous science biology taxonomy ]]]></description>
				<content:encoded><![CDATA[<p>The <a href="http://www.EOL.org">Encyclopedia of Life</a>  is encouraging citizen contributions to its experts-vetted pages, so far with what seem like excellent results. There&#8217;s a good article about this at <a href="http://www.sciencedaily.com/releases/2009/08/090824210040.htm">Science Daily</a>. After two years, they&#8217;ve got 150,000 species pages underway, with 1.4 million stubs awaiting drafting.</p>
<p><span id="tagspan" class='tags'>[Tags: <a href="http://www.technorati.com/tag/berkman" rel="tag"></a> <a href="http://www.technorati.com/tag/crowdsourcing" rel="tag">crowdsourcing</a> <a href="http://www.technorati.com/tag/everything is miscellaneous" rel="tag">everything_is_miscellaneous</a> <a href="http://www.technorati.com/tag/science" rel="tag">science</a> <a href="http://www.technorati.com/tag/biology" rel="tag">biology</a> <a href="http://www.technorati.com/tag/taxonomy" rel="tag">taxonomy</a> ]</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2009/08/26/encyclopedia-of-life-now-by-humans/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Dilbert goes miscellaneous</title>
		<link>http://www.hyperorg.com/blogger/2009/08/19/dilbert-goes-miscellaneous/</link>
		<comments>http://www.hyperorg.com/blogger/2009/08/19/dilbert-goes-miscellaneous/#comments</comments>
		<pubDate>Wed, 19 Aug 2009 11:50:10 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[comics]]></category>
		<category><![CDATA[dilbert]]></category>
		<category><![CDATA[everythingIsMiscellaneous]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[humor]]></category>
		<category><![CDATA[taxonomy]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/2009/08/19/dilbert-goes-miscellaneous/</guid>
		<description><![CDATA[Amusing Dilbert today, for those who can&#8217;t resist a good taxonomy joke. (Thanks for the tip, Helena!) [Tags: everything_is_miscellaneous comics dilbert humor taxonomy ]]]></description>
				<content:encoded><![CDATA[<p>Amusing <a href="http://dilbert.com/strips/comic/2009-08-19/">Dilbert</a> today, for those who can&#8217;t resist a good taxonomy joke. (Thanks for the tip, Helena!)</p>
<p><span id="tagspan" class='tags'>[Tags: <a href="http://www.technorati.com/tag/berkman" rel="tag"></a> <a href="http://www.technorati.com/tag/everything is miscellaneous" rel="tag">everything_is_miscellaneous</a> <a href="http://www.technorati.com/tag/comics" rel="tag">comics</a> <a href="http://www.technorati.com/tag/dilbert" rel="tag">dilbert</a> <a href="http://www.technorati.com/tag/humor" rel="tag">humor</a> <a href="http://www.technorati.com/tag/taxonomy" rel="tag">taxonomy</a> ]</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2009/08/19/dilbert-goes-miscellaneous/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Search Pidgin</title>
		<link>http://www.hyperorg.com/blogger/2009/08/14/search-pidgin/</link>
		<comments>http://www.hyperorg.com/blogger/2009/08/14/search-pidgin/#comments</comments>
		<pubDate>Fri, 14 Aug 2009 20:01:05 +0000</pubDate>
		<dc:creator>davidw</dc:creator>
				<category><![CDATA[everythingIsMiscellaneous]]></category>
		<category><![CDATA[everything_is_miscellaneous]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[natural_language_processing]]></category>
		<category><![CDATA[nlp]]></category>
		<category><![CDATA[pidgin]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.hyperorg.com/blogger/2009/08/14/search-pidgin/</guid>
		<description><![CDATA[I know I&#8217;m not the only one who&#8217;s finding WolframAlpha sometimes frustrating because I can&#8217;t figure out the magic words to use to invoke the genii. To give just one example, I can&#8217;t figure out how to see the frequency of the surnames Kumar and Weinberger compared side-by-side in WolframAlpha&#8217;s signature fashion. It&#8217;s a small [...]]]></description>
				<content:encoded><![CDATA[<p>I know I&#8217;m not the only one who&#8217;s finding <a href="http://www.wolframalpha.com">WolframAlpha</a> sometimes frustrating because I can&#8217;t figure out the magic words to use to invoke the genii. To give just one example, I can&#8217;t figure out how to see the frequency of the surnames  Kumar and Weinberger compared side-by-side in WolframAlpha&#8217;s signature fashion. It&#8217;s a small thing because &#8220;surname Kumar&#8221; and &#8220;surname Weinberger&#8221; will get you info about each individually. But over and over, I fail to guess the way WolframAlpha wants me to phrase the question. </p>
<p>Search engines are easier because they have already trained us how to talk to them. We know that we generally get the same results whether we use the stop words &#8220;when,&#8221; &#8220;the,&#8221; etc. and questions marks or not. We eventually learn that quoting a phrase searches for exactly that phrase. We may even learn that in many engines, putting a dash in front of a word excludes pages containing it from the results, or that we can do marvelous and magical things with prefaces that end in a colon site:, define:. We also learn the semantics of searching: If you want to find out the name of that guy who&#8217;s Ishmael&#8217;s friend in Moby-Dick, you&#8217;ll do best to include some words likely to be on the same page, so &#8220;&#8216;What was the name of that guy in Moby-Dick who was the hero&#8217;s friend?&#8217;&#8221; is way worse than &#8220;Moby-Dick harpoonist&#8217;.&#8221; I have no idea what the curve of query sophistication looks like, but most of us have been trained to one degree or another by the search engines who are our masters and our betters. </p>
<p>In short, we&#8217;re being taught a <a href="http://en.wikipedia.org/wiki/Pidgin">pidgin</a> language â€”  a simplified language for communicating across cultures. In this case, the two cultures are human and computers. I only wish the pidgin were more uniform and useful. Google has enough dominance in the market that its syntax influences other search engines. Good! But we could use some help taking the next step, formulating more complex natural language queries in a pidgin that crosses application boundaries, and that isn&#8217;t designed for standard database queries.</p>
<p>Or does this already exist?</p>
<p><span id="tagspan" class='tags'>Tags: <a href="http://www.technorati.com/tag/berkman" rel="tag"></a> <a href="http://www.technorati.com/tag/search" rel="tag">search</a> <a href="http://www.technorati.com/tag/pidgin" rel="tag">pidgin</a> <a href="http://www.technorati.com/tag/nlp" rel="tag">nlp</a> <a href="http://www.technorati.com/tag/natural language processing" rel="tag">natural_language_processing</a> <a href="http://www.technorati.com/tag/google" rel="tag">google</a> <a href="http://www.technorati.com/tag/everything is miscellaneous" rel="tag">everything_is_miscellaneous</a> </span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.hyperorg.com/blogger/2009/08/14/search-pidgin/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 0.421 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2013-06-16 16:33:54 -->