<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>C L A Y F O X &#187; photo-sharing site</title>
	<atom:link href="http://www.clayfox.com/tag/photo-sharing-site/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.clayfox.com</link>
	<description></description>
	<lastBuildDate>Sun, 22 Jan 2012 17:31:32 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Life in the taggregate</title>
		<link>http://www.clayfox.com/2007/11/23/life-in-the-taggregate/</link>
		<comments>http://www.clayfox.com/2007/11/23/life-in-the-taggregate/#comments</comments>
		<pubDate>Fri, 23 Nov 2007 15:33:09 +0000</pubDate>
		<dc:creator>Mark Phillipson</dc:creator>
				<category><![CDATA[Libraryworld]]></category>
		<category><![CDATA[Tagging]]></category>
		<category><![CDATA[^]]></category>
		<category><![CDATA[aggregate processing]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[collective intelligence systems]]></category>
		<category><![CDATA[e-science]]></category>
		<category><![CDATA[France]]></category>
		<category><![CDATA[Jonah Bossewitch]]></category>
		<category><![CDATA[Paris France]]></category>
		<category><![CDATA[photo-sharing site]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[social software]]></category>
		<category><![CDATA[TagMaps]]></category>
		<category><![CDATA[Texas]]></category>
		<category><![CDATA[Texas United States]]></category>
		<category><![CDATA[Tim Berners-Lee]]></category>
		<category><![CDATA[Tom Gruber]]></category>
		<category><![CDATA[Yahoo!]]></category>
		<category><![CDATA[Yahoo! Labs]]></category>

		<guid isPermaLink="false">http://www.clayfox.com/2007/11/23/life-in-the-taggregate/</guid>
		<description><![CDATA[From its earliest days, the promise of the Semantic Web has been to bring networked computers closer to the forms and priorities of human inquiry. This promise depends on mark-up language that gives data some structure, and frameworks that bring such structure into recognizable relationships. As a May 2001 Scientific American piece by Tim Berners-Lee [...]]]></description>
			<content:encoded><![CDATA[<p>From its earliest days, the promise of the <a href="http://en.wikipedia.org/wiki/Semantic_web">Semantic Web</a> has been to bring networked computers closer to the forms and priorities of human inquiry.  This promise depends on mark-up language that gives data some structure, and frameworks that bring such structure into recognizable relationships.   As a <a href="http://www.sciam.com/article.cfm?articleID=00048144-10D2-1C70-84A9809EC588EF21">May 2001 Scientific American piece by Tim Berners-Lee and colleagues</a> put it, &#8220;for the semantic web to function, computers must have access to structured collections of information and sets of inference rules that they can use to conduct automated reasoning.&#8221;  </p>
<p>Automated reasoning!  This dream may be <a href="http://www.google.com/url?sa=t&#038;ct=res&#038;cd=1&#038;url=http%3A%2F%2Feprints.ecs.soton.ac.uk%2F12614%2F01%2FSemantic_Web_Revisted.pdf&#038;ei=_epGR-XJJJLagQLKkKmDDQ&#038;usg=AFQjCNEY4uyef3VSWbBKWK3zzbWM7vnNEg&#038;sig2=yYSMS7vHC0A0w4fb7S4iew">coming to life in e-science</a>, with its highly structured and interoperable datasets, but in many other contexts the idea of a Semantic Web sits uneasily with the younger and more popular kid on the block, the Participatory Web.  Web 2.0 environments amasses a lot of data and, more importantly, a lot of information about this data generated by humans downright impervious to the need of machines for identifiable and consistent structure.  Such tags are generally free-form, non-hierarchical, not expressing relationships in a predictable and consistent way; they dance to &#8220;folksonomy&#8221; not &#8220;taxonomy&#8221;; they are blithely untethered to &#8220;<a href="http://en.wikipedia.org/wiki/Ontology_(computer_science)">ontologies</a>,&#8221; to any URI-based language standards.   </p>
<p>Nevertheless there is intriguing thought out there about the potential <a href="http://iswc2006.semanticweb.org/program/webpanel.php">interplay of the Semantic Web and Web 2.0</a>.  The <a href="http://tagcommons.org/">Tagcommons</a> sites lays out Use Cases that envision sharing tags across databases, and sketches out some functional requirements to make that interoperability happen.  Tom Gruber, in particular, has argued energetically for &#8220;<a href="http://tomgruber.org/writing/CollectiveKnowledgeSystems.htm">collective intelligence systems</a>&#8221;  built from syntheses  of structured data and social software; his travel-review site <a href="http://realtravel.com/">RealTravel</a> uses a &#8220;snap-to-grid&#8221; model to disambiguate and structure user-supplied tags.  </p>
<p>And now in <a href="http://www.yahooresearchberkeley.com/">Yahoo! Research Berkeley labs</a>, algorithms are starting to take into account aggregate patterns in order to sift out meaning from vast oceans of community-generated tags despite all their unstructured messiness &#8212; or, as computer scientists like to say, despite all their &#8220;noise.&#8221;  It&#8217;s a matter of inference and cluster analysis.  Case in point:  the photo-sharing site <a href="http://www.flickr.com/">Flickr</a>&#8216;s new experiments in extracting &#8220;practical information about the world&#8221; from the snapshots and tags poured into it by the great unwashed.  The report &#8220;<a href="http://portal.acm.org/citation.cfm?id=1291384&#038;coll=portal&#038;dl=ACM&#038;CFID=222830&#038;CFTOKEN=20286026">How flickr helps us make sense of the world: context and content in community-contributed media collections</a>,&#8221;  describes a layered process of tag and image analysis&#8211;one that can be conducted entirely by machines&#8211;that identifies representational tags as well as place and event semantics.  </p>
<p>What does all this do for us?  For one thing, it can improve a search through piles of community-contributed materials; my search for &#8220;Harlem&#8221; stands a better chance of coming up with the most representative picture of the neighborhood, or a set of iteratively varied views of the neighborhood, or even a conglomeration of views for a composite view.  I could determine the most visited place in the neighborhood, or the scenes of important events.  Yahoo!&#8217;s researchers are even thinking about automatic tagging of photos, or suggestions for tags, that are generated by visual content abetted by contextual and geographical cues.  </p>
<p>Here are a couple of spins of Yahoo! Labs&#8217; <a href="http://tagmaps.research.yahoo.com/">TagMaps</a>:</p>
<p><img src='http://www.clayfox.com/blog/wp-content/uploads/2007/11/flickrmap.jpg' alt='Flickr World Browser Harlem' /></p>
<p><em>^ TagMap&#8217;s World Browser analyzes Flickr tags to locate &#8220;Harlem&#8221; on a map and offer a set of representative photos (on the right).  Harlem seems pushed to the west, and the chicken picture is a little odd, but this machine-generated guess seems viable enough.<br />
</em></p>
<p><img src='http://www.clayfox.com/blog/wp-content/uploads/2007/11/flickrmapparis.jpg' alt='TagMap World Browser Paris' /><br />
<em><br />
^ A search for &#8216;Paris&#8217; in TagMap&#8217;s World Browser whisks us to a city in the middle of France, not Texas, and avoids any pictures of over-photographed heiresses. See:  machines have taste too.</em></p>
<p>Teasing meaning out of cacophony, evaluating &#8216;where what &#038; when&#8217; through dumb processing of inconsistent human traces: it&#8217;s not hard to sense an artificial intelligence awakening here with its own priorities, despite the human decision (conscious or not) to ignore machine-oriented information conventions.  What is the ultimate effect of algorithms trained to crunch through the idiosyncratic and identify the representational?  Could such aggregate processing of unstructured data fuel a general regression to the mean, as <a href="http://alchemicalmusings.org/2007/11/13/crowded-wisdom/">alchemist Jonah Bossewitch muses</a>?   As a Trekkie (<a href="http://en.wikipedia.org/wiki/Trekkie#Trekkie_v._Trekker">or is it Trekker?</a>) might say, streaming into yet another convention, <a href="http://en.wikipedia.org/wiki/Resistance_is_futile">resistance is futile</a>.</p>
<p>The fear of human conglomeration coming into sudden sentience is nothing new, of course.  I just re-read <a href="http://en.wikipedia.org/wiki/Frankenstein">Frankenstein</a> with a set of fresh young readers, and alarmist correlations of that good old story to a improbably persistent, flexible, and collective-mashed form of AI doubtlessly come too easily to me now.  But I do sometimes wonder whether we too will wake up from our most logocentric tagging idylls to sense senseless and unblinking eyes, watching us in the dark and hungry for more.  </p>
]]></content:encoded>
			<wfw:commentRss>http://www.clayfox.com/2007/11/23/life-in-the-taggregate/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

