<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"
>

<channel>
	<title>Hal Roberts &#187; google</title>
	<atom:link href="http://blogs.law.harvard.edu/hroberts/category/google/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.law.harvard.edu/hroberts</link>
	<description>watching technology</description>
	<lastBuildDate>Wed, 17 Jun 2009 19:27:14 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<creativeCommons:license>http://creativecommons.org/licenses/by-sa/3.0/</creativeCommons:license>
		<item>
		<title>Digital Cameras v. Nigeria</title>
		<link>http://blogs.law.harvard.edu/hroberts/2008/08/12/digital-cameras-v-nigeria/</link>
		<comments>http://blogs.law.harvard.edu/hroberts/2008/08/12/digital-cameras-v-nigeria/#comments</comments>
		<pubDate>Tue, 12 Aug 2008 17:32:46 +0000</pubDate>
		<dc:creator>hal</dc:creator>
				<category><![CDATA[ads]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[surveillance]]></category>

		<guid isPermaLink="false">http://blogs.law.harvard.edu/hroberts/2008/08/12/digital-cameras-v-nigeria/</guid>
		<description><![CDATA[
One of my guiding theories of the modern media / advertising landscape is that the extensive real time surveillance of consumers by online advertisers and content providers encourages the growth of content about digital cameras (the content about which is easily monetized) at the expense of hard news, especially international news about developing countries like [...]]]></description>
			<content:encoded><![CDATA[<p>
One of my guiding theories of the modern media / advertising landscape is that the extensive real time surveillance of consumers by online advertisers and content providers encourages the growth of content about digital cameras (the content about which is easily monetized) at the expense of hard news, especially international news about developing countries like Nigeria.
</p>
<p>
The following <a href="http://google.com/insights/search">google insights</a> <a href="http://www.google.com/insights/search/#cat=&amp;q=newspaper%2Cblog%2Cmagazine&amp;geo=NG&amp;date=&amp;clp=&amp;cmpt=q">chart</a> of <font color="blue">digital camera</font> v. <font color="red">Nigeria</font> searches over time strikes a blow against that theory:
</p>
<p><img src="http://blogs.law.harvard.edu/hroberts/files/2008/08/google_insight_media_nigeria1.png" /></p>
<p>
Of course, this data say nothing about the amount of content produced about the respective topics, but the whole point of the google insights tool (which is targeted at advertisers) is to tell advertisers and content providers what sorts of content consumers are interested in.  Content about digital cameras is likely still more profitable, since digital camera ad clicks presumably pay more than Nigeria ad clicks, but the decline in digital camera searches is still striking.  It&#8217;s possible that this trend is merely the result of declining interest in digital cameras (which is surprising), but the fact that searches about Nigeria have not decreased over time is interesting in itself.  Quick <a href="http://www.google.com/insights/search/#cat=&amp;q=hdtv%2Cguatamala&amp;geo=&amp;date=&amp;clp=&amp;cmpt=q">checks</a> of <a href="http://www.google.com/insights/search/#cat=&amp;q=ipod%2Cvietnam&amp;geo=&amp;date=&amp;clp=&amp;cmpt=q">similar</a> <a href="http://www.google.com/insights/search/#cat=&amp;q=banana%20republic%2Cfamine&amp;geo=&amp;date=&amp;clp=&amp;cmpt=q">comparisons</a> show that consumer product content is more popular than hard news content, but that there is no accelerating trend in that direction.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.law.harvard.edu/hroberts/2008/08/12/digital-cameras-v-nigeria/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by-sa/3.0/</creativeCommons:license>
	</item>
		<item>
		<title>Google Insights: Newspaper v. Blog v.Magazine</title>
		<link>http://blogs.law.harvard.edu/hroberts/2008/08/12/google-insights-newspaper-v-blog-vmagazine/</link>
		<comments>http://blogs.law.harvard.edu/hroberts/2008/08/12/google-insights-newspaper-v-blog-vmagazine/#comments</comments>
		<pubDate>Tue, 12 Aug 2008 17:08:53 +0000</pubDate>
		<dc:creator>hal</dc:creator>
				<category><![CDATA[ads]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[surveillance]]></category>

		<guid isPermaLink="false">http://blogs.law.harvard.edu/hroberts/2008/08/12/google-insights-newspaper-v-blog-vmagazine/</guid>
		<description><![CDATA[
I&#8217;ve been playing around with the new Google Insights for Search, which is targeted to advertisers but is terribly interesting for anyone interested in media issues.  Here&#8217;s a comparison of searches for newspaper, blog, and magazine:
Worldwide


U.S.

Nigeria


Leaving aside the obvious qualifications about the limitations of this metric, the fact that blogs have become more popular [...]]]></description>
			<content:encoded><![CDATA[<p>
I&#8217;ve been playing around with the new <a href="http://www.google.com/insights/search">Google Insights for Search</a>, which is targeted to advertisers but is terribly interesting for anyone interested in media issues.  Here&#8217;s a comparison of searches for <font color="blue">newspaper</font>, <font color="red">blog</font>, and <font color="orange">magazine</font>:</p>
<p><b><a href="http://www.google.com/insights/search/#cat=&amp;q=newspaper%2Cblog%2Cmagazine&amp;geo=&amp;date=&amp;clp=&amp;cmpt=q">Worldwide</a></b></p>
</p>
<p><img src="http://blogs.law.harvard.edu/hroberts/files/2008/08/google_insight_media.png" /></p>
<p><b><a href="http://www.google.com/insights/search/#cat=&amp;q=newspaper%2Cblog%2Cmagazine&amp;geo=US&amp;date=&amp;clp=&amp;cmpt=q">U.S.</a></b></p>
<p><img src="http://blogs.law.harvard.edu/hroberts/files/2008/08/google_insight_media_us.png" /></p>
<p><b><a href="http://www.google.com/insights/search/#cat=&amp;q=newspaper%2Cblog%2Cmagazine&amp;geo=NG&amp;date=&amp;clp=&amp;cmpt=q">Nigeria</a></b></p>
<p><img src="http://blogs.law.harvard.edu/hroberts/files/2008/08/google_insight_media_nigeria.png" /></p>
<p>
Leaving aside the obvious qualifications about the limitations of this metric, the fact that blogs have become more popular in relation to newspapers is obvious and only interesting to see it visualized so clearly.  But the slower rise of blogs in the U.S. vs. worldwide is not obvious to me, nor is the vastly higher (and growing) popularity of newspapers in Nigeria (one might guess without any specific knowledge of Nigeria that the technically sophisticated folks online in Nigeria would be more likely to access blogs).  It&#8217;s worth following the links to see further breakdowns of the data.
</p>
<p>
Much of my time will be lost (mostly productively) fiddling with this tool.  The tool critically includes csv exports for all searches as well with terms of use that allow personal or research use, which will allow this data easily to be mashed up with <a href="http://freebase.com">other sources of data</a>.  One complaint is the lack of support for easily embedding the resulting graphs on other pages (I had to screen capture the above charts).</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.law.harvard.edu/hroberts/2008/08/12/google-insights-newspaper-v-blog-vmagazine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by-sa/3.0/</creativeCommons:license>
	</item>
		<item>
		<title>Google Adwords Category Exclusion</title>
		<link>http://blogs.law.harvard.edu/hroberts/2008/06/24/google-adwords-category-exclusion/</link>
		<comments>http://blogs.law.harvard.edu/hroberts/2008/06/24/google-adwords-category-exclusion/#comments</comments>
		<pubDate>Tue, 24 Jun 2008 18:16:19 +0000</pubDate>
		<dc:creator>hal</dc:creator>
				<category><![CDATA[ads]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[surveillance]]></category>

		<guid isPermaLink="false">http://blogs.law.harvard.edu/hroberts/2008/06/24/google-adwords-category-exclusion/</guid>
		<description><![CDATA[
Google recently added category exclusion to its adwords system, allowing advertisers to choose not to support content that deals with topics such as &#8220;death &#38; tragedy&#8221; and &#8220;military &#38; international conflict&#8221;.  The new category exclusion feature allows an advertiser exclude from its content network any pages that belong to a specified set of topics [...]]]></description>
			<content:encoded><![CDATA[<p>
Google recently <a href="http://adwords.blogspot.com/2008/03/category-exclusion-for-content-network.html">added category exclusion</a> to its adwords system, allowing advertisers to choose not to support content that deals with topics such as &#8220;death &amp; tragedy&#8221; and &#8220;military &amp; international conflict&#8221;.  The new category exclusion feature allows an advertiser exclude from its content network any pages that belong to a specified set of <a href="http://adwords.google.com/support/bin/answer.py?answer=71503">topics</a> or <a href="http://adwords.google.com/support/bin/answer.py?answer=71871">page types</a>.
</p>
<p><img src="http://blogs.law.harvard.edu/hroberts/files/2008/06/adwords_exclusion_topics.png" /></p>
<p>Here are description of the exclusion topics:</p>
<blockquote>
<p>Conflict &amp; tragedy:</p>
<p>Crime, police &amp; emergency: Police blotters, news stories on fires, and emergency services resources</p>
<p>Death &amp; tragedy: Obituaries, bereavement services, accounts of natural disasters, and accidents</p>
<p>Military &amp; international Conflict: News about war, terrorism, and sensitive international relations</p>
<p />
<p>Edgy content:</p>
<p>Juvenile, gross &amp; bizarre Content: Jokes, weird pictures, and videos of stunts</p>
<p>Profanity &amp; rough language: Moderate use of profane language</p>
<p>Sexually suggestive content: Provocative pictures and text</p>
</blockquote>
<p>
The topic exclusion is the most interesting, because it directly supports an argument I&#8217;m making (in a paper I&#8217;m writing now on surveillance, google, and botnets) that google&#8217;s adwords system is a primary driver of a move away from stories about &#8220;death &amp; tragedy&#8221; in far away places and toward stories about digital cameras.  Notice that &#8220;consumer electronics&#8221; is not one of the exclusion topics.
</p>
<p>Here are the page types:</p>
<blockquote>
<p>Network types:</p>
<p>Parked domains are sites in Google&#8217;s AdSense for domains network. Users are brought to parked domain sites when they enter the URL of an undeveloped webpage into a browser&#8217;s address bar. There, they&#8217;ll see ads relevant to the terminology in the URL they entered. The AdSense for domains network is encompassed by both the content network and the search network. If you exclude this page type, you&#8217;ll exclude all parked domain sites, including the ones on the search network. Learn more.</p>
<p>Error pages are part of Google&#8217;s AdSense for errors network. Certain users are brought to error pages when they enter a search query or unregistered URL in a browser&#8217;s address bar. There, they&#8217;ll see ads relevant to the search query or URL they entered. Learn more.</p>
<p />
<p>User-generated content:</p>
<p>Forums are websites devoted to open discussion of a topic.</p>
<p>Social networks are websites offering an interactive network of friends with personal profiles.</p>
<p>Image-sharing pages allow users to upload and view images.</p>
<p>Video-sharing pages allow users to view uploaded videos. </p>
</blockquote>
<p>
These page types are interesting as well because they exert control over the presentation as well the substance of a given topic.  If the commercial interests don&#8217;t like video sharing then by gosh there will be less video sharing.
</p>
<p>
For both the page types and the topics it would be helpful for Google to provide information about how specific pages are classified.  Such knowledge could discourage content owners from publishing content that they know will trigger the &#8220;death &amp; tragedy&#8221; topic, but the lack of knowledge about how the classification works could have the even worse effect of content producers being doubly careful not to produce any content that <i>might</i> be classified as an excluded topic.  In either case, the topics are likely to have a strong effect on the kinds of content that gets published.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.law.harvard.edu/hroberts/2008/06/24/google-adwords-category-exclusion/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by-sa/3.0/</creativeCommons:license>
	</item>
		<item>
		<title>Two Spectacles</title>
		<link>http://blogs.law.harvard.edu/hroberts/2008/03/28/two-spectacles/</link>
		<comments>http://blogs.law.harvard.edu/hroberts/2008/03/28/two-spectacles/#comments</comments>
		<pubDate>Fri, 28 Mar 2008 16:21:36 +0000</pubDate>
		<dc:creator>hal</dc:creator>
				<category><![CDATA[ads]]></category>
		<category><![CDATA[bukatman]]></category>
		<category><![CDATA[burroughs]]></category>
		<category><![CDATA[foucault]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[spectacle]]></category>
		<category><![CDATA[surveillance]]></category>

		<guid isPermaLink="false">http://blogs.law.harvard.edu/hroberts/2008/03/28/two-spectacles/</guid>
		<description><![CDATA[I&#8217;ve been pondering how the concept of spectacle fits in with surveillance.  In particular, I&#8217;ve been bouncing around two different concepts of the spectacle, one by Michel Foucault and the other by Scott Bukatman.
Here&#8217;s an execution spectacle in Michel Foucault&#8217;s Discipline &#38; Punish:

&#8216;Finally, he was quartered,&#8217; recounts the Gazette d&#8217;Amsterdam of 1 April 1757. [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been pondering how the concept of spectacle fits in with surveillance.  In particular, I&#8217;ve been bouncing around two different concepts of the spectacle, one by Michel Foucault and the other by Scott Bukatman.</p>
<p>Here&#8217;s an execution spectacle in Michel Foucault&#8217;s <i>Discipline &amp; Punish</i>:</p>
<blockquote><p>
&#8216;Finally, he was quartered,&#8217; recounts the Gazette d&#8217;Amsterdam of 1 April 1757.  &#8216;This last operation was very long, because the horses used were not accustomed to drawing; consequently, instead of four, six were needed; and when that did not suffice, they were forced, in order to cut off the wretch&#8217;s thighs, to sever the sinews and hack at the joints&#8230; (Foucault 75)
</p></blockquote>
<p>Here&#8217;s a science fiction spectacle (William Burrough&#8217;s death dwarf in <i>Nova Express</i>) in Scott Bukatman&#8217;s <i>Terminal Identity</i>:</p>
<blockquote><p>
&#8220;Images &#8212; millions of images &#8212; That what I eat &#8212; Cyclotron shit &#8212; Ever trying kicking that habit with apormorphine? &#8212; Now I got all the images of sex acts and torture ever took place anywhere and I can just blast it out and control you gooks right down to the molecule &#8212; I got orgasms &#8212; I got screams &#8212; I got all the images any hick poet ever shit out &#8212; My power&#8217;s coming &#8212; My power&#8217;s coming &#8212; My power&#8217;s coming. &#8230; And I got millions of images of Me, Me, Me meee.&#8221; (Bukatman 45)
</p></blockquote>
<p>Foucault&#8217;s <i>Discipline &amp; Punish</i> describes an arc from disciplining society through the use of public executions as spectacle to disciplining society through the more complex interaction of an array of civilizing institutions (prisons, schools, hospitals, factories, etc) and their various agents (guards, judges, experts, teachers, etc).  Foucault argues that the spectacles were effective insofar as they represented an imposition of the king&#8217;s body onto the body of the public, and at the same time provided space for the condemned to voice their frustrations against the monarchy.  This system of disciplining the public began to break apart as the power of the people grew and consequently the king&#8217;s symbolic body lost power relative to the body of the people, which is to say that the increasingly powerful public was able to question the fairness of the terrifying executions and the one sided prosecutions that led to them.</p>
<p>What eventually replaced these spectacles, Foucault argues, was the modern set of institutions whose most important impact was to embed discipline into the social fabric itself, rather than imposing discipline bodily through bloody spectacle.  Prisoners, pupils, patients, and workers (as examples) learned that they were being watched continuously and that their fates were judged by a set of scientific criteria which were themselves defined by the system of watching.  Instead of investing all power in the prosecutor, the modern judicial system invests its power into the jury&#8217;s ability to objectively judge the truth of various pieces of witness and scientific testimony.  The guilty man is condemned not because of the will of the king but because our objective system determines him to be guilty.  If you don&#8217;t want to be judged guilty, you have to judge yourself by this objective criteria, rather than by the arbitrary decisions of the king.  And if you want to succeed in school or at work, you have to measure up to objective criteria.  But those objective criteria are themselves influenced by this process.  Experts prune themselves for their work in court, social institutions design the tests that determine school success, and so on.</p>
<p>What&#8217;s interesting to note is that Foucault described in 1975 a move away from spectacle and to this more complex and subtle integration of the scientific and the social as a means of social discipline.  In 1994, Scott Bukatman&#8217;s <i>Terminal Identity</i> captured a widely held consensus that television and other mass media had once again made spectacle the dominant mode of social discipline.  Bukatman pulls together a set of social theorists, media thinkers, and science fiction authors to describe a &#8220;terminal space&#8221; in which a flood of image, audio, and text &#8220;blips&#8221; constitute a never ending spectacle through which society defines itself.  This image of society as spectacle contrasts sharply with the image drawn just twenty year before by Foucault.  And the spectacle has multiplied itself many times with the explosive growth of the Internet since 1994, including the growth of Internet pornography and YouTube beatings that make Burrough&#8217;s death dwarf (above) look prescient.</p>
<p>To figure out what this move away from and then back to spectacle means, we first have to figure out what &#8220;spectacle&#8221; means and whether Foucault and Bukatman are referring to the same thing at all.  Foucault&#8217;s spectacle of execution and Bukatmans&#8217; (and others&#8217;) spectacle of mass media both refer to a display of  striking images.  The public execution is striking largely because it is dramatically physical.  One cannot watch a execution without a strong, physical reaction.  Bukatman&#8217;s flood of images (and other media) is striking partly because each image is designed for emotional impact (buy this SUV if you want to dominate the road) but mostly because of the sheer number of images.  It is striking to be shown many different images at once, even if each image is just as a single solid color.  Foucault&#8217;s spectacle is striking because it is so strongly physical, whereas the modern media spectacle is completely virtual &#8212; it is a spectacle because of the sheer flood of input that cannot be reproduced bodily.  Bukatman argues that the lack of physicality actually defines the media spectacle as such:  &#8220;pure spectacle &#8230; [is] &#8230; a proliferation of of semiotic systems and simulations which increasingly serve to replace physical human experience and interaction.&#8221; (Bukatman 26)  Most importantly, both of these spectacles are used a source of control over society, and the impact of each form of spectacle relies on the dramatic impact of the spectacle itself &#8212; this need for dramatic impact is why, for instance, modern executions by injection serve nothing like the role of the spectacular public executions that Foucault describes.  In contrast to the slow diffusion of the social knowledge through institutions, spectacle derives its power from its ability to reach directly into the brain of its subjects and create an immediate reaction (who wants to argue over which textbook a school should use when you can just make a YouTube video and beam your truth directly into kids&#8217; brains?).  </p>
<p>Coming back to surveillance, both of these forms of spectacle (like institutional discipline) involve not just being watched, but watching as well.  The spectacle of execution is an application of watching onto the public: not only does the execution enact the punishment of the kind on the body of the people, but the process of prosecution applies the eye of the king onto the people.  By watching and judging the condemned, the king is making clear that he is watching the public, both symbolically and through his state apparatus.  The watching and being watched of this process are necessarily entangled: one cannot have a public execution without a prosecution, and the prosecution has no social impact if it is not in turn watched.</p>
<p>The modern media spectacle provides an even more tangled relationship between watching and being watched.  Much of the modern spectacle is advertising, which is about pushing images to consumers to get them to watch them.  But advertisements are only useful if the advertiser knows who is watching them. Television ratings and commercials are necessary complements, as are clicks and web advertisements.  An advertisement (and any spectacle used as a social lever) is only useful insofar as its impact can be measured, and knowledge of who is watching an ad is necessary to measure that impact.  This tangled relationship between watching and being watched applies not only to the advertiser and the consumer, but quickly spreads out into the whole range of different actors involved.  The content provider watches which ads are most profitable and watches which content brings consumers to its ads, the ad brokers like google watch both consumers and advertisers to determine which ads are most valuable and profitable, the various participants in the botnet economy watch (or pretend to watch) the ads to game profits while also watching google to determine how to avoid its click fraud filtering, search engine optimization (SEO) agents of various levels of legitimacy watch google to improve their customers&#8217; positions in the google index, security professionals of various sorts watch the botnets to learn how to protect against them and watch users to determine if they are infect or likely to get infected, and on and on.  Every one of these actors is both watching and being watched, and each one is a necessary growth of the system that begins with the simple display of an image intended to leverage some sort of social control.  It is impossible to say which actors are merely being watched and which are watching, just as it is impossible to point to any activity that does not involve both watching and being watched by multiple actors.</p>
<p>What&#8217;s different between execution as spectacle and media as spectacle is that executions are hard to repeat, whereas each of the little images that together make up the media spectacle are easy to produce.  Today, production and distribution of these images on the Internet has become virtually free, so anyone can produce these images, and they are ominpresent in the media (online, tv, etc).  But this ease of production makes the effects of the spectacle much, much less clear.  Foucault argues that the execution as discipline ended largely because it began to grow network effects that the king could not control, and that was just from a single producer of generally infrequent spectacles.  The result of the proliferation of spectacle producers today is a hugely complex network, briefly sketched above, whose effects are mostly emergent and unpredictable.  When anyone can leverage control through a spectacle, everyone does, but those effects all bounce off of one another in complex feedback loops.  Foucault describes a similar effect in his institutional disciplinary society, but the effect in his case involves far fewer major players and is therefore much less complex.  Today anyone can create a spectacle to influence society (buy a car, vote for my candidate, scream, laugh, cry).</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.law.harvard.edu/hroberts/2008/03/28/two-spectacles/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by-sa/3.0/</creativeCommons:license>
	</item>
		<item>
		<title>Google Privacy Videos</title>
		<link>http://blogs.law.harvard.edu/hroberts/2008/03/06/google-watching-personal-data-collection/</link>
		<comments>http://blogs.law.harvard.edu/hroberts/2008/03/06/google-watching-personal-data-collection/#comments</comments>
		<pubDate>Thu, 06 Mar 2008 20:21:51 +0000</pubDate>
		<dc:creator>hal</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[surveillance]]></category>

		<guid isPermaLink="false">http://blogs.law.harvard.edu/hroberts/2008/03/06/google-watching-personal-data-collec</guid>
		<description><![CDATA[I recently ran across a series of videos produced by Google to explain the data collection of its search engine.  Ms. Oyhe, an attractive, professional, and terribly reassuring support engineer explains what sorts of data Google collects and, implicitly, why users should not be overly concerned about the data collection:

To improve out search results [...]]]></description>
			<content:encoded><![CDATA[<p>I recently ran across <a href="http://www.youtube.com/googleprivacy">a series of videos</a> produced by Google to explain the data collection of its search engine.  Ms. Oyhe, an attractive, professional, and terribly reassuring support engineer <a href="http://www.youtube.com/watch?v=kLgJYBRzUXY&amp;feature=PlayList&amp;p=ECB20E29232BCBBA&amp;index=2">explains</a> what sorts of data Google collects and, implicitly, why users should not be overly concerned about the data collection:</p>
<blockquote><p>
To improve out search results as well as maintain security and prevent fraud we remember some basic information about searches.  Without this information, our search engine wouldn&#8217;t work as well as it does or be as secure. &#8230; We&#8217;re able to do that [replace 'carss' with 'cars'] because we&#8217;ve analyzed search queries in our logs and found that when people type in &#8216;carss&#8217; they really mean &#8216;cars&#8217;. &#8230; Only your provider can directly match you with your ip address. &#8230; What a cookie doesn&#8217;t tell google is personal stuff about you like where you live and what your phone number is. &#8230; In the same way that a store keeps a receipt of your purchases, google keeps a kind of receipt of your visit called a log. &#8230; As you can see, logs don&#8217;t contain any truly personal information about you.
</p></blockquote>
<p>All of this is true in a narrowly technical sense.  What&#8217;s missing is the recognition that the importance of data is determined by the larger world in which it lives &#8212; by the other data that it connects to.  So when Ms. Ohye asserts that a cookie doesn&#8217;t tell google &#8220;personal stuff about you like where you live&#8221; that&#8217;s only true in the sense that the cookie that your driver&#8217;s license number doesn&#8217;t tell the police where you live.  As with the driver&#8217;s license number, however, even though the cookie itself is just a random string of gibberish letters, it can indeed be used to lookup personal information &#8220;like where you live.&#8221;  </p>
<p>For example, the cookie connects, reasonably well, all searches performed by a single person.  Many, many people search for their own names at some point and for their own addresses at some point (if for no other reason than to see their houses in google maps).  The cookie connects those two searches to the same (otherwise anonymous) person, thus potentially identifying the name and address of the person behind the random gibberish of a particular cookie.  This method of identification is not perfect, but researchers have consistently shown the ability to crack the identity of individual users in these kinds of data collections with anonymous but individually unique identifiers, as learned by <a href="http://www.nytimes.com/2006/08/09/technology/09aol.html?_r=1&amp;ex=1312776000&amp;oref=slogin">AOL</a> and <a href="http://arxiv.org/abs/cs/0610105">Netflix</a>. </p>
<p>In fact, it&#8217;s likely that this collection of search terms, ip addresses, and cookies represents perhaps the largest, most sensitive single collection of data extant, on- or offline.  Google may or may not choose to do the relatively easy work necessary to translate its collection of search data into a database of personally identifiable data, but it does have the data and the ability to query personal data out of the collection at any time if it chooses (or is made to choose by a government, intruder, disgruntled worker, etc).</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.law.harvard.edu/hroberts/2008/03/06/google-watching-personal-data-collection/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	<creativeCommons:license>http://creativecommons.org/licenses/by-sa/3.0/</creativeCommons:license>
	</item>
	</channel>
</rss>
