<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Politics of Systems &#187; search engines</title>
	<atom:link href="http://thepoliticsofsystems.net/category/search-engines/feed/" rel="self" type="application/rss+xml" />
	<link>http://thepoliticsofsystems.net</link>
	<description>Thoughts on Software, Power, and Digital Method</description>
	<lastBuildDate>Thu, 17 May 2012 08:10:54 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Google&#8217;s knowledge graph: here comes the ontology</title>
		<link>http://thepoliticsofsystems.net/2012/05/googles-knowledge-graph-here-comes-the-ontology/</link>
		<comments>http://thepoliticsofsystems.net/2012/05/googles-knowledge-graph-here-comes-the-ontology/#comments</comments>
		<pubDate>Thu, 17 May 2012 08:04:49 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[actor-network theory]]></category>
		<category><![CDATA[algorithms]]></category>
		<category><![CDATA[critique]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[economy]]></category>
		<category><![CDATA[epistemolgy]]></category>
		<category><![CDATA[ontologies]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=479</guid>
		<description><![CDATA[Yesterday, Google introduced a new feature, which represents a substantial extension to how their search engine presents information and marks a significant departure from some of the principles that have underpinned their conceptual and technological approach since 1998. The &#8220;knowledge graph&#8221; basically adds a layer to the search engine that is based on formal knowledge]]></description>
			<content:encoded><![CDATA[<p>Yesterday, Google <a href="http://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html">introduced </a>a new feature, which represents a substantial extension to how their search engine presents information and marks a significant departure from some of the principles that have underpinned their conceptual and technological approach since 1998. The &#8220;knowledge graph&#8221; basically adds a layer to the search engine that is based on formal knowledge modelling rather than word statistics (relevance measures) and link analysis (authority measures). As the title of the post on Google&#8217;s search blog aptly points out, the new features work by searching &#8220;things not strings&#8221;, because what they call the knowledge graph is simply a &#8211; very large &#8211; <a href="http://en.wikipedia.org/wiki/Ontology_%28information_science%29">ontology</a>, a formal description of objects in the world. Unfortunately, the roll-out is progressive and I have not yet been able to access the new features, but the descriptions, pictures, and video paint a rather clear picture of what product manager Johanna Wright calls the move &#8220;from an information engine to a knowledge engine&#8221;. In terms of the <a href="http://en.wikipedia.org/wiki/DIKW">DIKW model</a> (Data-Information-Knowledge-Wisdom), the new feature proposes to move up a layer by adding a box of factual information on a recognized object (the examples Google uses are the Taj Mahal, Marie Curie, Matt Groening, etc.) next to the search results. From the presentation, we can gather that the 500 million objects already referenced will include a large variety of things, such as movies, events, organizations, ideas, and so on.</p>
<p><img class="alignleft" title="Knowledge Graph" src="http://4.bp.blogspot.com/-6CZW79UMwyg/T7PKsKaiyyI/AAAAAAAAJK0/yj5a8qKknQg/s2000/marie%2Bcurie.png" alt="" width="640" height="286" /></p>
<p>This is really a very significant extension to the current logice and although we&#8217;ll need more time to try things out and get a better understanding of what this actually means, there are a couple of things that we can already single out:</p>
<ul>
<li>On a feature level, the fact box brings Google closer to &#8220;knowledge engines&#8221; such as <a href="http://www.wolframalpha.com/">Wolfram Alpha</a> and as we learn from the explanatory video, this explicitly includes semantic or computational queries, such as &#8220;how many women won the Nobel Prize?&#8221; type of questions.</li>
<li>If we consider Wikipedia to be a similar &#8220;description layer&#8221;, the fact box can also be seen as a competitor to everybody&#8217;s favorite encyclopedia, which is a further step into the direction of bringing information directly to the surface of the results page instead of simply referring to a location. This means that users do not have to leave the Google garden to find a quick answer. It will be interesting to see whether this will actually show up in Wikipedia traffic stats.</li>
<li>The introduction of an ontology layer is a significant departure from the largely statistical and graph theoretical methods favored by Google in the past. While features based on knowledge modelling have proliferated around the margins (e.g. in Google Maps and Local Search), the company is now bringing them to the center stage. From what I understand, the selection of &#8220;facts&#8221; to display will be largely driven by user statistics but the facts themselves come from places like <a href="http://www.freebase.com/">Freebase</a>, which Google bought in 2010. While large scale ontologies were prohibitive in the past, a combination of the availability of crowd-sourced databases (Wikipedia, etc.), the open data movement, better knowledge extraction mechanisms, and simply the resources to hire people to do manual repairs has apparently made them a viable option for a company of Google&#8217;s size.</li>
<li>Competing with the dominant search engine has just become a lot harder (again). If users like the new feature, the threshold for market entry moves up because this is not a trivial technical gimmick that can be easily replicated.</li>
<li>The knowledge graph will most certainly spread out into many other services (it&#8217;s already implemented in the new Google Docs <a href="http://arstechnica.com/business/2012/05/google-docs-new-sidebar-makes-research-faster/">research bar</a>), further boosting the company&#8217;s economies of scale and enhancing cross-navigation between the different services.</li>
<li>If the fact box &#8211; and the features that may follow &#8211; becomes a pervasive and popular feature, Google&#8217;s participation in making information and knowledge accessible, in defining its shape, scope, and relevance, will be further extended. This is a reason to worry a bit more, not because the Google tools as such are a danger, but simply because of the levels of institutional and economic concentration the Internet has enabled. The company has become what Michel Callon calls an &#8220;obligatory passage point&#8221; in our relation to the Web and beyond; the knowledge graph has the potential to exacerbate the situation even further.</li>
</ul>
<p>This is a development that looks like another element in the war for dominance on the Web that is currently fought at a frenetic pace. Since the introduction of <a href="https://developers.facebook.com/docs/opengraph/">actions</a> into Facebook&#8217;s social graph, it has become clear that approaches based on ontologies and concept modelling will play an increasing role in this. In a world mediated by screens, the technological control of <em>meaning</em> &#8211; the one true metamedium &#8211; is the new battleground. I guess that this is not what Berners-Lee had in mind for the Semantic Web&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2012/05/googles-knowledge-graph-here-comes-the-ontology/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>1934: the network as hierarchy</title>
		<link>http://thepoliticsofsystems.net/2012/03/1934-the-network-as-a-hierarchy/</link>
		<comments>http://thepoliticsofsystems.net/2012/03/1934-the-network-as-a-hierarchy/#comments</comments>
		<pubDate>Mon, 19 Mar 2012 07:46:16 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[algorithms]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[network theory]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[social networks]]></category>
		<category><![CDATA[software studies]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=447</guid>
		<description><![CDATA[I am currently writing a paper to submit to the new and very exciting journal computational culture on the use of graph theory to produce &#8220;evaluative metrics&#8221; in contexts like Web search or social networking. One of my core arguments is going to be that the network as descriptive (mathematical) model has never stood in]]></description>
			<content:encoded><![CDATA[<div id="attachment_466" class="wp-caption alignleft" style="width: 221px"><a href="http://thepoliticsofsystems.net/wp-content/uploads/2012/03/moreno_1934_p152.png"><img class=" wp-image-466" title="moreno_1934_p152" src="http://thepoliticsofsystems.net/wp-content/uploads/2012/03/moreno_1934_p152-211x300.png" alt="" width="211" height="300" /></a><p class="wp-caption-text">one of Moreno&#39;s famous sociograms</p></div>
<p>I am currently writing a paper to submit to the new and very exciting journal <a href="http://computationalculture.net/">computational culture</a> on the use of graph theory to produce &#8220;evaluative metrics&#8221; in contexts like Web search or social networking. One of my core arguments is going to be that the network as descriptive (mathematical) model has never stood in opposition to the notion of hierarchy but should rather be seen as a conceptual tool that was used in different fields (e.g. sociometry, psychometry, citation analysis, etc.) over the 20th century to investigate structure and, in particular, to both investigate and establish hierarchy. This finally gave me an excuse to dive into Jacob L. Moreno&#8217;s opus magnum <em>Who Shall Survive? </em>from 1934, which not only founded sociometry but also laid the ground work for social network analysis. This is one of the strangest books I have ever read, not only because the <a href="http://www.scribd.com/doc/82912410/Who-Shall-Survive-J-L-Moreno-1978-879pgs-PSY">edition from 1978</a> reveals the author as a deeply Nietzschean character (&#8220;<em>Actually, I have written two bibles, an old testament and a new testament.</em>&#8220;), but also because the <em>sociogenic therapy</em> Moreno proposes as an approach to the &#8220;German-Jewish conflict&#8221; puts the whole text in a deeply saddening light. But these aspects only deepen the impression that this is a fascinating book, really one of its kind.</p>
<p>Interestingly, Moreno also discovered what we would now call &#8220;power-law dynamics in social networks&#8221;. One of the applications of his &#8220;sociometric test&#8221; &#8211; basically a &#8220;who do you like&#8221; type of questionnaire &#8211; in a small American town named Hudson came to the following result:</p>
<blockquote><p>After the first phase of the sociometric test was given the analysis of the choices revealed that among a population of 435 persons,23 204, or 46.5%, remained unchosen after the 1st choice; 139, or 30%, after the 2d choice; 87, or 20%, after the 3rd choice; 74, or 17%, after the 4th choice; and 66, or 15%, after the 5th choice. (Moreno 1934, p. 249)</p></blockquote>
<div id="attachment_470" class="wp-caption alignleft" style="width: 221px"><a href="http://thepoliticsofsystems.net/wp-content/uploads/2012/03/Screen-Shot-2012-03-20-at-09.54.57-.png"><img class=" wp-image-470 " title="moreno_1934_powerlaw" src="http://thepoliticsofsystems.net/wp-content/uploads/2012/03/Screen-Shot-2012-03-20-at-09.54.57--266x300.png" alt="" width="211" /></a><p class="wp-caption-text">Moreno&#39;s comparison of distributions</p></div>
<p>This means that 15% of the population was not mentioned when the interviewees were asked which five people in the community they liked best. While this does not make for a particularly skewed distribution, Moreno transposes the result on the population of New York city and adds a quite tantalizing interpretation:</p>
<blockquote>
<p style="text-align: left;">There is no question but that this phenomenon repeats itself throughout the nation, however widely the number of unchosen may vary from 1st to 5th or more choices due to the incalculable influence of sexual, racial, and other psychological currents. For New York, with a population of 7,000,000, the above percentages would be after the 1st choice, 3,200,000 individuals unchosen; after the 2nd choice, 2,100,000 unchosen; after the 3rd choice, 1,400,000 unchosen; after the 4th choice, 1,200,000 unchosen; and after the 5th choice, 1,050,000 unchosen. These calculations suggest that mankind is divided not only into races and nations, religions and states, but into socionomic divisions. There is produced a socionomic hierarchy due to the differences in attraction of particular individuals and groups for other particular individuals and groups. (Moreno 1934, p. 250f)</p>
</blockquote>
<p>By looking into the history of the field, I hope to show that the observation of uneven distributions of connectivity in real-world networks, e.g. the work by <a href="http://press.princeton.edu/titles/8781.html">Hindman</a> and others concerning the Web, are certainly not a discovery of the &#8220;<a href="http://research.yahoo.com/files/w_ARS.pdf">new science of networks</a>&#8221; of recent years but a virtual constant in mathematical approaches to networks: whenever somebody starts counting, the result is an ordered list, normally with a considerable difference in value between the first and the last element. When it comes to applications of sociometry to sociology or anthropology, the question of leadership, status, influence, etc. is permanently in the forefront, especially from the 1950s onward when matrix algebra starts to allow for quick calculations of different forms of centrality. Contrary to popular myth, when Page and Brin came up with <em>PageRank</em>, they had a very wide variety of inspirational sources to draw from. Networks and ranking had been an old couple for quite a while already.</p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2012/03/1934-the-network-as-a-hierarchy/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>a 1953 recursive algorithm for computing status</title>
		<link>http://thepoliticsofsystems.net/2012/03/a-1953-recursive-algorithm-for-computing-status/</link>
		<comments>http://thepoliticsofsystems.net/2012/03/a-1953-recursive-algorithm-for-computing-status/#comments</comments>
		<pubDate>Mon, 12 Mar 2012 09:28:50 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[algorithms]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[software studies]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=439</guid>
		<description><![CDATA[In 1953, Leo Katz, psychologist of the measuring kind, wrote the following: The purpose of this paper is to suggest a new method of computing status, taking into account not only the number of direct &#8220;votes&#8221; received by each individual but, also, the status of each individual who chooses the first, the status of each]]></description>
			<content:encoded><![CDATA[<p>In 1953, Leo Katz, psychologist of the measuring kind, wrote the following:</p>
<blockquote><p>The purpose of this paper is to suggest a new method of computing status, taking into account not only the number of direct &#8220;votes&#8221; received by each individual but, also, the status of each individual who chooses the first, the status of each who chooses these in turn, etc. Thus, the proposed new index allows for <em>who</em> chooses as well as how many choose.</p></blockquote>
<p>The <a href="http://ideas.repec.org/a/spr/psycho/v18y1953i1p39-43.html">paper</a> this is taken from is one of the references in Larry Page&#8217;s <a href="http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&amp;Sect2=HITOFF&amp;u=%2Fnetahtml%2FPTO%2Fsearch-adv.htm&amp;r=1&amp;p=1&amp;f=G&amp;l=50&amp;d=PTXT&amp;S1=6,285,999.PN.&amp;OS=pn/6,285,999&amp;RS=PN/6,285,999">PageRank patent</a>&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2012/03/a-1953-recursive-algorithm-for-computing-status/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>what a judge thinks about google instant</title>
		<link>http://thepoliticsofsystems.net/2012/01/what-a-judge-thinks-about-google-instant/</link>
		<comments>http://thepoliticsofsystems.net/2012/01/what-a-judge-thinks-about-google-instant/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 13:28:07 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[algorithms]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=424</guid>
		<description><![CDATA[In the middle of December, a French appeals court published its verdict in a case concerning Google&#8217;s instant/autocomplete/suggest feature and the company was fined $65K. After the holidays, a couple of publications (e.g. searchengineland and Ars Technica) picked up the story and as in every case where French legislation diverts from US sensibilities the comment]]></description>
			<content:encoded><![CDATA[<p>In the middle of December, a French appeals court published its verdict in a case concerning Google&#8217;s instant/autocomplete/suggest feature and the company was fined $65K. After the holidays, a couple of publications (e.g. <a href="http://searchengineland.com/google-instant-costs-google-65000-in-france-106136">searchengineland</a> and <a href="http://arstechnica.com/tech-policy/news/2012/01/french-court-frowns-on-google-autocomplete-issues-65000-fine.ars">Ars Technica</a>) picked up the story and as in every case where French legislation diverts from US sensibilities the comment sections erupted with chauvinistic righteousness. What was the case about? Here is the full text of a notice by the Courthouse News Service:</p>
<blockquote><p>A French court fined Google $65,000 because the search engine&#8217;s autocomplete function prompts the French word for crook when users type the name of a certain company. Lyonnaise de Garantie, an insurance company, said staffers at Google should have monitored linked words better. Google had argued that it was not liable since the word, added under Google Suggest, was the result of an automatic algorithm and did not come from human thought. A Paris court ruled against Google, however, pointing out that the search engine ignored requests to remove the offending word &#8211; &#8220;escroc,&#8221; which means crook in French. In addition to the fine, Google must also remove the term from searches associated with Lyonnaise de Garantie.</p></blockquote>
<p>Unfortunately, this is basically all the information that circulated in English. But it&#8217;s always interesting to have a closer look at how lawmakers and judges look at information-systems-as-media question and so I went to have a look at the <a href="http://www.legalis.net/spip.php?page=jurisprudence-decision&amp;id_article=3303">text</a> of the actual verdict.<br />
There are a couple of points that are really quite remarkable here, and make the case much more interesting than it appears. Google&#8217;s arguments basically made three arguments:</p>
<ul>
<li>We are an American company and therefore&#8230; (I will not go into the questions that are not specific to Web search.)</li>
<li>The suggest feature is purely &#8220;informatic&#8221; and does not represent an &#8220;intellectual act&#8221;, a &#8220;value judgement&#8221; or an &#8220;opinion&#8221;. (This is the common argument, nothing new here.)</li>
<li>The &#8220;average internet user&#8221; knows that search suggestions are not <em>content</em>. In fact, users do not make any interpretations independently from search results. There is &#8220;no confusion in their minds&#8221; about the difference. (Finally, things are getting more interesting!)</li>
</ul>
<p>The judge however did not see things this way and made a series of quite remarkable observations:</p>
<ul>
<li>If the process is fully automated, how does Google remove &#8220;offensive&#8221; and &#8220;vulgar&#8221; terms from the suggestion lists? Obviously, intervention is possible and regularly applied, even for content &#8211; such as vulgarity &#8211; that is not illegal. So why not in this case?</li>
<li>While it would certainly be difficult to find all cases where individuals or companies are put in a bad light in a suggest list, Google was perfectly aware in this case, because the company in question had contacted them repeatedly.</li>
<li>While the procedure may be automatic, the phrase “Lyonnaise de Garantie escroc” is a human judgement and its circulation on the net is made possible by the machinery. Using algorithms is just another way of &#8220;organizing and presenting human thought&#8221;.</li>
<li>The phase appears already at the moment when one types “Lyonnaise de G” and this &#8220;suddenness&#8221; has the effect of &#8220;imposing the expression&#8221; on the user.</li>
<li>When looking at the results for the query, they do not explain why the term &#8220;escroc&#8221; is attributed to the company, i.e. the content does not signal any facts that would justify the term.</li>
</ul>
<p>Now these are some interesting arguments and while I am not qualified to comment on the validity of the judgement, there is a stark contrast between Google&#8217;s and the judge&#8217;s framing of the question. While Google makes an ontological argument (&#8220;an algorithm cannot have an opinion&#8221;), the judge pushes that argument into the background and bases the verdict on the question &#8220;can Google be bothered to remove a text that is injurious?&#8221;. The answer is &#8220;yes&#8221;, because a) intervention is obviously possible and b) they were made aware by the plaintiff. It also treats the &#8220;instant&#8221; feature as living up to its former name: &#8220;suggest&#8221;.</p>
<p><a href="http://thepoliticsofsystems.net/wp-content/uploads/2012/01/Screen-Shot-2012-01-10-at-14.26.45-.png"><img class="alignleft size-thumbnail wp-image-427" title="Screen Shot 2012-01-10 at 14.26.45" src="http://thepoliticsofsystems.net/wp-content/uploads/2012/01/Screen-Shot-2012-01-10-at-14.26.45--150x150.png" alt="" width="150" height="150" /></a>While regulation of &#8220;indecency&#8221; is much less pronounced in Europe than in the US, libel laws are of course much stricter, but I do not want to comment on that. What I find thoroughly fascinating about this case is that legal professionals are forced to form opinions about questions as ambiguous as algorithmic agency. By choosing to judge outcomes rather than methodology, the judge in this case (and the judges that treated it in the first instance) have created a precedent that may affect the use of statistical and other techniques that often produce unforeseeable effects. On the other hand side, the verdict is largely based on the fact the the plaintiffs requests for removal were ignored. Google is by no means forced to police suggest features in the future.</p>
<p>Automated information systems order information very differently from manually compiled catalogs or category systems. They produce different forms of &#8220;intelligence&#8221; and it is difficult to think about their directness in terms of opinion or partisanship. What just happened in this case however is that, at least on a legal level, the gap between the two elements was closed a little bit. The judge did not require Google to put the algorithm on a leash but told them to pick up its mess.</p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2012/01/what-a-judge-thinks-about-google-instant/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>strange findings on the AdWords front</title>
		<link>http://thepoliticsofsystems.net/2011/10/strange-findings-on-the-adwords-front/</link>
		<comments>http://thepoliticsofsystems.net/2011/10/strange-findings-on-the-adwords-front/#comments</comments>
		<pubDate>Mon, 24 Oct 2011 13:12:45 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[economy]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=412</guid>
		<description><![CDATA[Google is an interesting company and not only because it has superbig data centers and mighty algorithms. It is also interesting because it controls a pretty big pie of the Internet advertisement market and uses a fascinating auction system to sell ad space. Need traffic? You can either SEO your site to the max or]]></description>
			<content:encoded><![CDATA[<p>Google is an interesting company and not only because it has superbig data centers and mighty algorithms. It is also interesting because it controls a pretty big pie of the Internet advertisement market and uses a fascinating auction system to sell ad space. Need traffic? You can either SEO your site to the max or just buy some advertisement. Google apparently has good prices and good results. At least Microsoft seems to think that:  <a href="http://thepoliticsofsystems.net/wp-content/uploads/2011/10/search_engines_IPUS.png"><img class="aligncenter size-full wp-image-413" title="search_engines_IPUS" src="http://thepoliticsofsystems.net/wp-content/uploads/2011/10/search_engines_IPUS.png" alt="" width="650" /></a>Yes, this is an ad for Bing in the first line. What is even more wondrous though is why Google would advertise (right column, third from the top) on the query &#8220;search engines&#8221; on their own site. AdWords must be very effective indeed.</p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2011/10/strange-findings-on-the-adwords-front/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>how to establish search result manipulation?</title>
		<link>http://thepoliticsofsystems.net/2011/09/how-to-establish-search-result-manipulation/</link>
		<comments>http://thepoliticsofsystems.net/2011/09/how-to-establish-search-result-manipulation/#comments</comments>
		<pubDate>Thu, 22 Sep 2011 08:34:16 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[critique]]></category>
		<category><![CDATA[economy]]></category>
		<category><![CDATA[method]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=395</guid>
		<description><![CDATA[Over the last couple of weeks, things have heated up considerably for Google &#8211; on the mobile side with the start of a patent war, but also in the search area, the core of the company&#8217;s business. Led by Senator Mike Lee (a Utah Republican), the US Senate&#8217;s Antitrust Subcommittee has started to probe into]]></description>
			<content:encoded><![CDATA[<p>Over the last couple of weeks, things have heated up considerably for Google &#8211; on the mobile side with the start of a patent war, but also in the search area, the core of the company&#8217;s business. Led by Senator Mike Lee (a Utah Republican), the US Senate&#8217;s Antitrust Subcommittee has <a href="http://www.politico.com/news/stories/0311/51152.html">started to probe</a> into certain aspects of Google&#8217;s ranking mechanisms and potential cases of abuse and manipulation.</p>
<p>In a hearing on Wednesday, Lee confronted Eric Schmidt with accusations of tampering with results and the evidence the Senator presented was in fact very interesting because it raises the question of how to <em>show</em> or even <em>prove</em> that a highly complex algorithmic procedure &#8220;has been tampered with&#8221;. As you can see in <a href="http://www.cbsnews.com/video/watch/?id=7381863nn">this video</a>, a scatter-plot from an &#8220;independent study&#8221; that compares the search ranking for three price comparison sites (Nextag, Pricegrabber, and Shopper) with Google Price Search using 650 shopping related queries. What we can see on the graph is that while there is considerable variation in ranking for the competitors (a site shows up first for one query and way down for another), Google&#8217;s site seems to consistently stick to place three. Lee makes this astounding difference the core of his argument and directly asks Schmidt: &#8220;These results are in fact the result of the same algorithm as the rankings for the other comparison sites?&#8221; The answer is interesting in itself as Schmidt argues that Google&#8217;s service is not a product comparison site but a &#8220;product site&#8221; and that the study basically compares apples to oranges (&#8220;they are different animals&#8221;). Lee then homes in on the &#8220;uncanny&#8221; statistical regularity and says &#8220;I don&#8217;t know whether you call this a separate algorithm or whether you&#8217;re reverse engineered a single algorithm, but either way, you&#8217;ve cooked it!&#8221; to which Schmidt replies &#8220;I can assure you that we haven&#8217;t cooked anything.&#8221;</p>
<p>According to this <a href="http://latimesblogs.latimes.com/money_co/2011/09/google-antitrust-eric-schmidt-senate-hearing-herb-kohl-mike-lee.html">LA Times article</a>, Schmidt&#8217;s testimony did not satisfy the senators and there&#8217;s open talk about bias and conflict of interest. I would like to add to add three things here:</p>
<p>1) The debate shows a real mismatch between 20th century concepts of both <em>bias</em> and <em>technology</em> and the 21st century challenge to both of these question that comes in the form of Google. For the senator, bias is something very blatant and obvious, a malicious individual going to the server room at night, tempering with the machinery, transforming the pure technological objectivity into travesty by inserting a line of code that puts Google to third place most of the time. The problem with this view is of course that it makes a clear and strong distinction between a &#8220;biased&#8221; and an &#8220;unbiased&#8221; algorithm and clearly misses the point that every ranking procedure implies a bias. If Schmidt says &#8220;We haven&#8217;t cooked anything!&#8221;, who has written the algorithm? If it comes to an audit of Google&#8217;s code, I am certain that no &#8220;smoking gun&#8221; in the form of a primitive and obvious &#8220;manipulation&#8221; will be found. If Google wants to favor its own services, there are much more subtle and efficient ways to do so &#8211; the company does have the best SEO team one could possibly imagine after all. There is simply no need to &#8220;cook&#8221; anything if you are the one who specifies the features of the algorithm.</p>
<p>2) The research method applied in the mentioned study however is really quite interesting and I am curious to see how far the Senate committee will be able to take the argument. The statistical regularity shown is certainly astounding and if the hearings attain a deeper level of technological expertise, Google may be forced to detail a significant portion of its ranking procedures to show how something like this can happen. It would, of course, be extremely simple to break the pattern by introducing some random element that does not affect the average rank but adds variation. That&#8217;s also the reason why I think that Lee&#8217;s argument will ultimately fizzle.</p>
<p>3) The core of the problem, I would argue, is not so much the question of manipulation but the fact that by branching into more and more commercial areas, Google finds itself in a market configuration where conflicts of interest are popping up everywhere they turn. As both a search business and an actor on many of the markets that are, at least in part, ordered by the visibility layering in search results, there is a fundamental and structural problem that cannot be solved by any kind of imagined technical neutrality. Even if there is no &#8220;in house SEO&#8221; going on, the mere fact that Google search prominently links to other company services could already be seen as problematic. In a sense, Senator Lee&#8217;s argument actually creates a potentially useful &#8220;way out&#8221;: if there is no evil line of code written in the dark of night, no &#8220;smoking gun&#8221;, then everything is fine. The systematic conflict of interest persists however, and I do not believe that more subtle forms of bias towards Google services could be proven or even be seriously debated in a court of law. This level of technicality, I would argue, is no longer (fully) in reach for this kind of causal demonstration. Not so much because of the complexity of the algorithms, but rather because the &#8220;state&#8221; of the machine includes the full structure of the dataset it is working on, which means the full index in this case. To understand what Google&#8217;s algorithms actually do, looking at these algorithms <em>without</em> the data is no longer enough. And the data is big. Very big.</p>
<p>As you can see, I am quite pessimistic about the possibility to bring the kind of argumentation presented by Senator Lee to a real conclusion. If the case against Microsoft is an indicator, I would argue that this pessimism is warranted.</p>
<p>I do believe that we need to concentrate much more on the principal conflicts of interest rather than actual cases of abuse that may be simply too difficult to prove. The fundamental question is really how far a search company that controls such a large portion of the global market should be allowed to be active in other markets. And, really, should a single company control the search market in the first place? Limiting <em>the very potential for abuse</em> is, in my view, the road that legislators and regulators should take, rather than picking a fight over technological issues that they simply cannot win in the long run.</p>
<p>EDIT: Google has compiled its own <a href="http://googlecompetition.blogspot.com/2011/09/guide-to-senate-judiciary-hearing.html">Guide to the Hearing</a>. Interesting.</p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2011/09/how-to-establish-search-result-manipulation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>simple causality, ranking, and segmentation</title>
		<link>http://thepoliticsofsystems.net/2011/07/simple-causality-ranking-and-segmentation/</link>
		<comments>http://thepoliticsofsystems.net/2011/07/simple-causality-ranking-and-segmentation/#comments</comments>
		<pubDate>Thu, 28 Jul 2011 11:05:54 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[epistemolgy]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[technological determinism]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=373</guid>
		<description><![CDATA[While scholars often underline their commitment to non-deterministic conceptions of &#8220;effects&#8221;, models of causality in the human and social sciences can still be a bit simplistic sometimes. But a more subtle approach to causality would have to concede that, while most often cumulative and contradictory, lines of causation can sometimes be quite straightforward. Just consider]]></description>
			<content:encoded><![CDATA[<p>While scholars often underline their commitment to non-deterministic conceptions of &#8220;effects&#8221;, models of causality in the human and social sciences can still be a bit simplistic sometimes. But a more subtle approach to causality would have to concede that, while most often cumulative and contradictory, lines of causation can sometimes be quite straightforward. Just consider this example from<em> Commensuration as a Social Process</em>, a great <a href="steinhardt.nyu.edu/scmsAdmin/uploads/000/341/Commensuration.pdf">text </a>from 1998 by Espeland and Stevens:</p>
<blockquote><p>Faculty at a well-regarded liberal arts college recently received unexpected, generous raises. Some, concerned over the disparity between their comfortable salaries and those of the college&#8217;s arguably underpaid staff, offered to share their raises with staff members. Their offers were rejected by administrators, who explained that their raises were &#8216;not about them.&#8217; Faculty salaries are one criterion magazines use to rank colleges. (p.313)</p></blockquote>
<p>This is a rather direct effect of ranking techniques on something very tangible, namely salary. But the relative straightforwardness of the example also highlights a bifurcation of effects: faculty gets paid more, staff less. The specific construction of the ranking mechanism in question therefore produces social segmentation. Or does it simply reinforce the existing segmentation between faculty and staff that lead college evaluators to construct the indicators the way they did in the first place? Well, there goes the simplicity&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2011/07/simple-causality-ranking-and-segmentation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>somebody slipped me an ontology into my algorithm</title>
		<link>http://thepoliticsofsystems.net/2011/07/somebody-slipped-me-an-ontology-into-my-algorithm/</link>
		<comments>http://thepoliticsofsystems.net/2011/07/somebody-slipped-me-an-ontology-into-my-algorithm/#comments</comments>
		<pubDate>Mon, 11 Jul 2011 07:55:18 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[algorithms]]></category>
		<category><![CDATA[ontologies]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=348</guid>
		<description><![CDATA[In the beginning, it was all about the algorithm. PageRank and its &#8220;no humans involved&#8221; mantra dominated Google since its inception. In recent years however, Google has started to expand the role of &#8220;conceptual&#8221; knowledge in different areas of its services. The main search bar and its capacity to do all kinds of little tricks]]></description>
			<content:encoded><![CDATA[<p>In the beginning, it was all about the algorithm. PageRank and its &#8220;no humans involved&#8221; mantra dominated Google since its inception. In recent years however, Google has started to expand the role of &#8220;conceptual&#8221; knowledge in different areas of its services. The main search bar and its capacity to do all kinds of little tricks is a good example, but I was really quite astounded how seamless concept integration has become on my last trip to Google Translate:</p>
<p><a href="http://thepoliticsofsystems.net/wp-content/uploads/2011/07/smart-google-translate1.png"><img class="alignnone size-full wp-image-353" title="smart google translate" src="http://thepoliticsofsystems.net/wp-content/uploads/2011/07/smart-google-translate1.png" alt="" width="557" height="389" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2011/07/somebody-slipped-me-an-ontology-into-my-algorithm/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Google downranking nasty merchants, but how?</title>
		<link>http://thepoliticsofsystems.net/2010/12/google-downranking-nasty-merchants-but-how/</link>
		<comments>http://thepoliticsofsystems.net/2010/12/google-downranking-nasty-merchants-but-how/#comments</comments>
		<pubDate>Thu, 02 Dec 2010 08:58:00 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[algorithms]]></category>
		<category><![CDATA[search engines]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=262</guid>
		<description><![CDATA[The Official Google Blog has recently written about changes to the ranking procedure that were introduced after a NYT article wrote about an online retailer that had apparently found out that being nasty to your customers would help getting good search rankings because all of the complaints and bad user reviews would get you links]]></description>
			<content:encoded><![CDATA[<p>The Official Google Blog has recently <a href="http://googleblog.blogspot.com/2010/12/being-bad-to-your-customers-is-bad-for.html">written about</a> changes to the ranking procedure that were introduced after a <a href="http://www.nytimes.com/2010/11/28/business/28borker.html?_r=2&amp;pagewanted=all">NYT article</a> wrote about an online retailer that had apparently found out that being nasty to your customers would help getting good search rankings because all of the complaints and bad user reviews would get you links and boost PageRank. While Google denies that this logic would work, they have added a ranking layer to their search results that specifically targets online merchants. The interesting thing about the blog post is that the author details several things that the company could have done but didn&#8217;t do while actually revealing very little about what the &#8220;algorithmic solution&#8221; they implemented actually consists of. From the post:</p>
<blockquote><p>Instead, in the last few days we developed an algorithmic solution which detects the merchant from the Times article along with hundreds of other merchants that, in our opinion, provide an extremely poor user experience. The algorithm we incorporated into our search rankings represents an initial solution to this issue, and Google users are now getting a better experience as a result.</p></blockquote>
<p>While I do not believe that transparency is the prime solution to the gatekeeper issues surrounding search, this paragraph really is strikingly vague. Has Google compiled a list of merchants that are systematically downranked? How is this list compiled? What does &#8220;in our opinion&#8221; mean? Is this &#8220;opinion&#8221; expressed in the form of an algorithmic procedure (one could imagine using the <a href="http://microformats.org/wiki/hreview">hReview microformat</a> to collect reviews on merchants)?</p>
<p>We&#8217;ll probably not get any answers to these questions but the case really shows how murky the whole ranking thing really has become: in an always growing online world, search visibility has extremely important financial ramifications (despite the social media hype) and I believe that companies like Google will increasingly rely on human judgment as a complement to algorithmic procedures (which are just another form of human judgment BTW). This will certainly lead to more legal activity around ranking in the future because courts still understand human meddling a lot better than software design&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2010/12/google-downranking-nasty-merchants-but-how/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bing + Facebook = I like echo chamber?</title>
		<link>http://thepoliticsofsystems.net/2010/10/bing-facebook-i-like-echo-chamber/</link>
		<comments>http://thepoliticsofsystems.net/2010/10/bing-facebook-i-like-echo-chamber/#comments</comments>
		<pubDate>Thu, 14 Oct 2010 07:14:37 +0000</pubDate>
		<dc:creator>Bernhard</dc:creator>
				<category><![CDATA[critique]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[social networks]]></category>

		<guid isPermaLink="false">http://thepoliticsofsystems.net/?p=224</guid>
		<description><![CDATA[Yesterday, Microsoft announced another step in their &#8220;long-term partnership&#8221; with Facebook. The two companies have had close ties since Microsoft invested a hefty sum in Facebook in 2007 and the former has managed advertisement on the latter&#8217;s site for quite a while. The &#8220;next step&#8221; will basically add a &#8220;social layer&#8221; to Bing search results]]></description>
			<content:encoded><![CDATA[<p>Yesterday, Microsoft <a href="http://www.bing.com/community/blogs/search/archive/2010/10/13/new-signals-in-search-the-bing-social-layer.aspx">announced</a> another step in their &#8220;long-term partnership&#8221; with Facebook. The two companies have had close ties since Microsoft invested a hefty sum in Facebook in 2007 and the former has managed advertisement on the latter&#8217;s site for quite a while. The &#8220;next step&#8221; will basically add a &#8220;social layer&#8221; to Bing search results (go to <a href="http://arstechnica.com/microsoft/news/2010/10/bing-adds-more-facebook-what-your-friends-like-people-search.ars">Ars Technica</a> for a writeup or <a href="http://kara.allthingsd.com/20101013/liveblogging-the-bing-facebook-bromance/?mod=googlenews">All Things Digital</a> for a liveblog of the PR event) and this is actually a pretty big thing. Google has certainly taken contextual information into account when deciding which results to show and how to rank them: physical location, search history, and gmail contacts have been part of that process for a while, but the effects have been rather subtle.</p>
<p>Bing&#8217;s new features basically use the same technical layer as the Facebook boxes that popped up all over the Web about half a year ago (most modern browsers have plug-ins that allow you to block those by the way). If Bing detects the Facebook cookie while you&#8217;re on their site and adds a couple of features that allow you to <a href="http://www.discoverbing.com/facebook/?fbid=9a9f5q93WgZ&amp;wom=false#step-2">interact</a> with &#8220;friends&#8221; more easily. There are some basic convenience features but it is the &#8220;liked results&#8221; that are the most remarkable: results will use your contact&#8217;s &#8220;likes&#8221; to rank results. While we will have to wait to see how these features will pan out, social search may look something like this:</p>
<p><img class="alignnone" src="http://www.discoverbing.com/img/screenshot-social_3.2.jpg" alt="Bing social search interface" width="620" height="170" /></p>
<p>In this example, the first result is the announcement of a news article on the release of the DVD version of Iron Man 2 and this would be hardly a top-ranked result without the social layer. If Bing continues to make inroads on Google, the &#8220;like&#8221; button may take on additional importance for driving traffic and marketeers will most certainly device new ways to get people to &#8220;like&#8221; stuff &#8211; e.g. &#8220;press the button and win a free t-shirt&#8221;.</p>
<p>Cas Sunstein&#8217;s arguments on the dangers of <a href="http://www.onthemedia.org/yore/transcripts/transcripts_020604_echo.html">echo chambers</a> &#8211; &#8220;incestuous amplification&#8221; in social groups &#8211; will certainly be taken up again, and perhaps rightfully so: while the Internet remains a beautifully heterogeneous mess, the algorithmically sustained support for the logic of homophily (&#8220;birds of a feather&#8230;&#8221;) that can be observed in more and more places on the Web merits critical examination. While Diana Mutz&#8217;s <a href="http://www.politicalreviewnet.com/polrev/reviews/JOPO/R_0022_3816_619_1007672.asp">work</a> makes the inconvenient argument that &#8220;hearing the other side&#8221; of political debate may actually lead to less political engagement, our representative systems of democratic governance require a certain willingness to accept different political viewpoints (that always float on less clearly delineated cultural sensibilities) as sincere and legitimate. Also, adding a &#8220;friend&#8221; dimension to yet another dimension of the Web could be seen as a further reduction of the &#8220;publicness&#8221; that, <a href="http://verbalperambulation.files.wordpress.com/2007/02/schudson-michael-why-conversation-is-not-the-soul-of-democracy.pdf">according to</a> Michael Schudson, caracterizes working democratic discourse. Being able to dissociate ourselves from our private entanglements and take into account the interests of those who do not ressemble us is perhaps the central prerequisite to successfully navigating a smaller planet.</p>
<p>Bing&#8217;s new features are certainly not the end of life as we know it but I believe that the privacy question &#8211; as important as it is &#8211; is covering a series of more difficult problems that sit at the heart of political life in the age of the Internet&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://thepoliticsofsystems.net/2010/10/bing-facebook-i-like-echo-chamber/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

