<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Dave Underwood on Digital Marketing &#38; Online Strategy &#187; search engines</title>
	<atom:link href="http://daveunderwood.com/category/search-engines/feed/" rel="self" type="application/rss+xml" />
	<link>http://daveunderwood.com</link>
	<description>A New Zealand perspective on search engine marketing, web analytics, social media &#38; more</description>
	<lastBuildDate>Sun, 25 Jul 2010 11:09:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>An interesting answer to an interesting question</title>
		<link>http://daveunderwood.com/2010-05-20/an-interesting-answer-to-an-interesting-question/</link>
		<comments>http://daveunderwood.com/2010-05-20/an-interesting-answer-to-an-interesting-question/#comments</comments>
		<pubDate>Thu, 20 May 2010 05:29:01 +0000</pubDate>
		<dc:creator>Dave Underwood</dc:creator>
				<category><![CDATA[future trends]]></category>
		<category><![CDATA[online marketing]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[Elliot Schrage]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[information]]></category>

		<guid isPermaLink="false">http://daveunderwood.com/?p=122</guid>
		<description><![CDATA[Q: &#8220;It used to be that I could limit what strangers saw about me to almost nothing. I could not show my profile picture, not allow them to “poke” or message me, certainly not allow them to view my profile page. Now, even my interests have to be public information. Why can’t I control my [...]]]></description>
			<content:encoded><![CDATA[<p></p><p><strong>Q:</strong> <a href="http://community.nytimes.com/comments/bits.blogs.nytimes.com/2010/05/06/ask-facebook-your-privacy-questions/?permid=15#comment15">&#8220;It used to be that I could limit what strangers saw about me to almost nothing. I could not show my profile picture, not allow them to “poke” or message me, certainly not allow them to view my profile page. Now, even my interests have to be public information. Why can’t I control my own information anymore?&#8221;</a></p>
<p><strong>Answer from Elliot Schrage, vice president for public policy at Facebook</strong>: &#8220;Joining Facebook is a conscious choice by vast numbers of people who have stepped forward deliberately and intentionally to connect and share. We study user activity. We’ve found that a few fields of information need to be shared to facilitate the kind of experience people come to Facebook to have. That’s why we require the following fields to be public: name, profile photo (if people choose to have one), gender, connections (again, if people choose to make them), and user ID number. Facebook provides a less satisfying experience for people who choose not to post a photo or make connections with friends or interests. But, other than name and gender, nothing requires them to complete these fields or share information they do not want to share. If you’re not comfortable sharing, don’t.&#8221;</p>
<p>Link: <a href="http://bits.blogs.nytimes.com/2010/05/11/facebook-executive-answers-reader-questions/">Facebook Executive Answers Reader Questions</a> [nytimes.com]</p>
]]></content:encoded>
			<wfw:commentRss>http://daveunderwood.com/2010-05-20/an-interesting-answer-to-an-interesting-question/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google Image Search: Specifying an image&#8217;s license</title>
		<link>http://daveunderwood.com/2010-01-25/google-image-search-specifying-an-images-license/</link>
		<comments>http://daveunderwood.com/2010-01-25/google-image-search-specifying-an-images-license/#comments</comments>
		<pubDate>Mon, 25 Jan 2010 08:07:31 +0000</pubDate>
		<dc:creator>Dave Underwood</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[google image search]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[microformats]]></category>
		<category><![CDATA[rdfa]]></category>

		<guid isPermaLink="false">http://daveunderwood.com/?p=103</guid>
		<description><![CDATA[Looks pretty simple. Source: YouTube Read more about RDFa.]]></description>
			<content:encoded><![CDATA[<p></p><p>Looks pretty simple. </p>
<p><object width="480" height="289"><param name="movie" value="http://www.youtube.com/v/quyhasVn2jw&amp;hl=en_GB&amp;fs=1&amp;rel=0"><param name="allowFullScreen" value="true"><param name="allowscriptaccess" value="always"><embed src="http://www.youtube.com/v/quyhasVn2jw&amp;hl=en_GB&amp;fs=1&amp;rel=0" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="480" height="289"></object><br />
Source: <a href="http://www.youtube.com/watch?v=quyhasVn2jw">YouTube</a></p>
<p><a href="http://en.wikipedia.org/wiki/RDFa">Read more about RDFa</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://daveunderwood.com/2010-01-25/google-image-search-specifying-an-images-license/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Google News Search &#8211; Article Ranking Factors</title>
		<link>http://daveunderwood.com/2009-09-05/google-news-search-article-ranking-factors/</link>
		<comments>http://daveunderwood.com/2009-09-05/google-news-search-article-ranking-factors/#comments</comments>
		<pubDate>Sat, 05 Sep 2009 04:49:50 +0000</pubDate>
		<dc:creator>Dave Underwood</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[news search]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://daveunderwood.com/?p=76</guid>
		<description><![CDATA[Source: YouTube]]></description>
			<content:encoded><![CDATA[<p></p><p><object width="480" height="289"><param name="movie" value="http://www.youtube.com/v/hg8xgoULIIE&#038;hl=en&#038;fs=1&#038;"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/hg8xgoULIIE&#038;hl=en&#038;fs=1&#038;" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="480" height="289"></embed></object><br />
Source: <a href="http://www.youtube.com/watch?v=hg8xgoULIIE">YouTube</a></p>
]]></content:encoded>
			<wfw:commentRss>http://daveunderwood.com/2009-09-05/google-news-search-article-ranking-factors/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Wolfram Alpha</title>
		<link>http://daveunderwood.com/2009-05-16/wolfram-alpha/</link>
		<comments>http://daveunderwood.com/2009-05-16/wolfram-alpha/#comments</comments>
		<pubDate>Sat, 16 May 2009 23:31:41 +0000</pubDate>
		<dc:creator>Dave Underwood</dc:creator>
				<category><![CDATA[search engines]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://daveunderwood.com/?p=59</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p></p><p><img src="http://daveunderwood.com/images/2009-05-16/wolframalphaloadexceeded.jpg" alt="Sorry? Say sorry to his kids!" /></p>
]]></content:encoded>
			<wfw:commentRss>http://daveunderwood.com/2009-05-16/wolfram-alpha/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The anatomy of Google</title>
		<link>http://daveunderwood.com/2007-04-09/the-anatomy-of-google/</link>
		<comments>http://daveunderwood.com/2007-04-09/the-anatomy-of-google/#comments</comments>
		<pubDate>Mon, 09 Apr 2007 22:25:48 +0000</pubDate>
		<dc:creator>Dave Underwood</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[Larry Page]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[Sergey Brin]]></category>

		<guid isPermaLink="false">http://daveunderwood.com/?p=10</guid>
		<description><![CDATA[Only nine years late, via Speaking Freely, I am reading the paper &#8216;The Anatomy of a Large-Scale Hypertextual Web Search Engine&#8216; (a.k.a Google) by Sergey Brin and Larry Page. I liked this bit about the Google crawler interrupting an online game: It turns out that running a crawler which connects to more than half a [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Only nine years late, via <a href="http://speakingfreely.wordpress.com/google-patents-and-papers/" title="Speaking Freely">Speaking Freely</a>, I am reading the paper &#8216;<a href="http://citeseer.ist.psu.edu/cache/papers/cs/13017/http:zSzzSzwww-db.stanford.eduzSzpubzSzpaperszSzgoogle.pdf/brin98anatomy.pdf" title="The Anatomy of a Large-Scale Hypertextual Web Search Engine">The Anatomy of a Large-Scale Hypertextual Web Search Engine</a>&#8216; (a.k.a Google) by Sergey Brin and Larry Page.</p>
<p>I liked this bit about the Google crawler interrupting an online game:<br />
<blockquote>It turns out that running a crawler which connects to more than half a million servers, and generates tens of millions of log entries generates a fair amount of email and phone calls. Because of the vast number of people coming on line, there are always those who do not know what a crawler is, because this is the first one they have seen. Almost daily, we receive an email something like, &#8220;Wow, you looked at a lot of pages from my web site. How did you like it?&#8221; There are also some people who do not know about the robots exclusion protocol, and think their page should be protected from indexing by a statement like, &#8220;This page is copyrighted and should not be indexed&#8221;, which needless to say is difficult for web crawlers to understand. Also, because of the huge amount of data involved, unexpected things will happen. For example, our system tried to crawl an online game. This resulted in lots of garbage messages in the middle of their game! It turns out this was an easy problem to fix. But this problem had not come up until we had downloaded tens of millions of pages. Because of the immense variation in web pages and servers, it is virtually impossible to test a crawler without running it on large part of the Internet. Invariably, there are hundreds of obscure problems which may only occur on one page out of the whole web and cause the crawler to crash, or worse, cause unpredictable or incorrect behavior. Systems which access large parts of the Internet need to be designed to be very robust and carefully tested. Since large complex systems such as crawlers will invariably cause problems, there needs to be significant resources devoted to reading the email and solving these problems as they come up.</p></blockquote>
<p>Source: &#8216;<a href="http://citeseer.ist.psu.edu/cache/papers/cs/13017/http:zSzzSzwww-db.stanford.eduzSzpubzSzpaperszSzgoogle.pdf/brin98anatomy.pdf" title="The Anatomy of a Large-Scale Hypertextual Web Search Engine">The Anatomy of a Large-Scale Hypertextual Web Search Engine</a>&#8216; Brin/Page, p. 10</p>
<p>It is also interesting to note the beginnings of <a href="http://www.google.co.nz/books?hl=en" title="Google Book Search">Google Book Search</a> in the acknowledgements:<br />
<blockquote>The research described here was conducted as part of the Stanford Integrated Digital Library Project, supported by the National Science Foundation under Cooperative Agreement IRI-9411306. Funding for this cooperative agreement is also provided by DARPA and NASA, and by Interval Research, and the industrial partners of the Stanford Digital Libraries Project.</p></blockquote>
<p>Source: &#8216;<a href="http://citeseer.ist.psu.edu/cache/papers/cs/13017/http:zSzzSzwww-db.stanford.eduzSzpubzSzpaperszSzgoogle.pdf/brin98anatomy.pdf" title="The Anatomy of a Large-Scale Hypertextual Web Search Engine">The Anatomy of a Large-Scale Hypertextual Web Search Engine</a>&#8216; Brin/Page, p. 16</p>
<p>Note also their thoughts on the relationship of search engines and advertising:<br />
<blockquote>Currently, the predominant business model for commercial search engines is advertising. The goals of the advertising business model do not always correspond to providing quality search to users. For example, in our prototype search engine one of the top results for cellular phone is &#8220;The Effect of Cellular Phone Use Upon Driver Attention&#8221;, a study which explains in great detail the distractions and risk associated with conversing on a cell phone while driving. This search result came up first because of its high importance as judged by the PageRank algorithm, an approximation of citation importance on the web [Page, 98]. It is clear that a search engine which was taking money for showing cellular phone ads would have difficulty justifying the page that our system returned to its paying advertisers. For this type of reason and historical experience with other media [Bagdikian 83], we expect that advertising funded search engines will be inherently biased towards the advertisers and away from the needs of the consumers.</p></blockquote>
<p>Source: &#8216;<a href="http://citeseer.ist.psu.edu/cache/papers/cs/13017/http:zSzzSzwww-db.stanford.eduzSzpubzSzpaperszSzgoogle.pdf/brin98anatomy.pdf" title="The Anatomy of a Large-Scale Hypertextual Web Search Engine">The Anatomy of a Large-Scale Hypertextual Web Search Engine</a>&#8216; Brin/Page, p. 18</p>
]]></content:encoded>
			<wfw:commentRss>http://daveunderwood.com/2007-04-09/the-anatomy-of-google/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
