<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Yellow SEO - Search Engine Optimization Blog &#187; search engines</title>
	<atom:link href="http://www.yellowseo.com/blog/tag/search-engines/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.yellowseo.com/blog</link>
	<description>Rank Better.  Drive traffic.  Get Results.</description>
	<lastBuildDate>Fri, 23 Apr 2010 13:49:11 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Wolfram Alpha Launch: The Dawn of a New Era for the Web?</title>
		<link>http://www.yellowseo.com/blog/seo-news/wolfram-alpha-launch/</link>
		<comments>http://www.yellowseo.com/blog/seo-news/wolfram-alpha-launch/#comments</comments>
		<pubDate>Sat, 16 May 2009 19:02:11 +0000</pubDate>
		<dc:creator>Amber</dc:creator>
				<category><![CDATA[SEO News]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[search results]]></category>
		<category><![CDATA[Wolfram Alpha]]></category>

		<guid isPermaLink="false">http://www.yellowseo.com/blog/?p=223</guid>
		<description><![CDATA[Wolfram Alpha is an intriguing new search engine which is based on building a large set of facts into a system and making it computable.  On May 15th at 7 pm, WolframAlpha became public.  Initial queries like &#8220;What is the meaning of life?&#8221; Answer: 42 were quite hilarious. (fyi, references the age-old Hitchhikers [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.yellowseo.com/blog/seo-news/wolfram-alpha-launch/attachment/wolfram2/" title="wolfram2"><img src="http://www.yellowseo.com/blog/wp-content/uploads/2009/05/wolfram2.jpg" alt="wolfram2" width="276" height="254" class="attachment wp-att-301 alignleft" /></a>Wolfram Alpha is an intriguing new search engine which is based on building a large set of facts into a system and making it computable.  On May 15th at 7 pm, WolframAlpha became public.  Initial queries like &#8220;What is the meaning of life?&#8221; Answer: 42 were quite hilarious. (fyi, references the age-old Hitchhikers guide to the galaxy)</p>
<p>After a few more queries, it became clear that there are a lot of facts which the system does not yet know about.  However, we believe that it will quickly change.</p>
<p><span id="more-223"></span></p>
<p>Stephen Wolfram, the mastermind behind <em>Mathematica</em> and <em>A New Kind of Science</em> which proved to big a huge success and brought world recognition to the author, is now challenging the concept of search engine. Wolfram Alpha is targeted to put the rules of mathematical data and linguistics together to compile the worlds first computational knowledge engine. At the base of this quite ambitious project, is the belief in the power of mathematical knowledge in generating effective solutions to a wide range of problems.</p>
<p>For years search engines have been trying to improve the relevance of the search results. When Google first appeared, it <span> </span>became an online sensation by providing the closest match to all search queries. In relatively short time, Yahoo, MSN and others were left behind as Google established its reputation of the industry leader with the market share close to 70%.</p>
<p>Wolfram Alpha offers a refreshing approach to improving the search results. It eliminates the guesswork and the hassle of going through a variety of webpages to find the best answer to your question. Wolfram Alpha uses a special algorithm to pull the information from different sources and combine them into a complete logical answer.<br />
People tend to be righteously skeptical about the release of new search engines. We have seen the appearance and quick decay of many of them which failed to deliver their far-going promises. Is Wolfram Alpha one of the many destined to be buried under Google dominance or if incredibly successful, is it going to become the next “Googlekiller”?</p>
<p>We lean towards the opinion that Google and Wolfram Alpha are not necessarily fierce rivals since people will go to each of them for different reasons. Google is very effective when you need to find various information sources and go actual webpages, whereas Wolfram Alpha will be most useful when you look for simple factual answers to your questions. The latter can definitely divert some of Google’s traffic yet it won’t be sufficient to change the entire search engine hierarchy.</p>
<p>An interesting note is that fact oriented processing engines like WolframAlpha effectively eliminate the previous <a title="search engine optimization" href="http://www.yellowseo.com/">search engine optimization</a> rules.</p>
<p>Truth to be said, at this early stage we can only make up theories and now that we&#8217;ve had time to play with it, Google has nothing to worry about in the near future. History has proven multiple times that a true success is rarely about having a great idea but more so about its<span> </span>execution.</p>
<p>However, if Wolfram Alpha meets even a portion of its high expectations in the future, it could be a new intriguing web destination.  In any case, building a huge computational cluster, with billions of factoids, and mathematical logic seems like the perfect precursor for the future Skynet.</p>
<img src="http://www.yellowseo.com/blog/?ak_action=api_record_view&id=223&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.yellowseo.com/blog/seo-news/wolfram-alpha-launch/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Caution:Blog Scrapers</title>
		<link>http://www.yellowseo.com/blog/general-musings/stop-blog-scrapers/</link>
		<comments>http://www.yellowseo.com/blog/general-musings/stop-blog-scrapers/#comments</comments>
		<pubDate>Fri, 08 May 2009 13:14:12 +0000</pubDate>
		<dc:creator>Amber</dc:creator>
				<category><![CDATA[General Musings]]></category>
		<category><![CDATA[SEO Tips]]></category>
		<category><![CDATA[blog scraping]]></category>
		<category><![CDATA[blog scraping prevention]]></category>
		<category><![CDATA[blogging]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[RSS feed]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://www.yellowseo.com/blog/?p=177</guid>
		<description><![CDATA[There has recently been a lot talk about the importance of blogging for SEO purposes. Many experts agree that a blog is a dynamic and interactive tool which allows you to effectively address vital issues and share relevant and useful information with readers both inside and outside the industry. But as blogging increases in popularity, [...]]]></description>
			<content:encoded><![CDATA[<p><a title="paws1" href="http://www.yellowseo.com/blog/general-musings/stop-blog-scrapers/attachment/paws1/"><img class="attachment wp-att-290 alignleft" src="http://www.yellowseo.com/blog/wp-content/uploads/2009/05/paws1.thumbnail.jpg" alt="paws1" width="276" height="254" /></a>There has recently been a lot talk about the importance of blogging for SEO purposes. Many experts agree that a <a title="blog" href="http://www.yellowseo.com/blog" target="_self">blog</a> is a dynamic and interactive tool which allows you to effectively address vital issues and share relevant and useful information with readers both inside and outside the industry. But as blogging increases in popularity, more websites are facing the alarming problem of content theft.<br />
My first encounter with this issue happened a while ego when I posted my first blog on <a title="Data Feed Optmization" href="http://www.yellowseo.com/blog/seo-tips/data-feed-optimization/" target="_self">Data Feed Optimization</a>. In a matter of a hours the word-by-word copy appeared on multiple sites. A few days later the original blog was completely knocked out of search results. What happened to me is not uncommon. Scrapers also known as content scrapers or feed scrapers are computer programs that go out onto the web and copy the content from other sites with unique content.<br />
<span id="more-177"></span><br />
They usually do it by adding your RSS to their site. Once they get a notification that you published something they automatically scrape the new post. They almost never link back leaving the poor low-ranked blog behind to suffer lower rankings.<br />
Another side of content theft is blunt plagiarism which I personally consider to be even more appalling. I was recently browsing the web looking for good blogs on the latest SEO trends and came across a great article by <a title="Stephan Spencer" href="http://www.marketingprofs.com/8/latest-seo-trends-tactics-spencer.asp" target="_blank">Stephan Spencer</a>. This article was duplicated at least 7 times on the same page for search results. Most bloggers from the first page did refer to the original author except for one. Some guy Chris Wood who claims to be a reputable online marketing consultant copied the entire blog word-after-word, pasted in into his own blog using site-specific design and formatting, and took the credit for it. Such practices occur every single day!<br />
One would think Google and other search engines are smart enough to detect cases of content theft but unfortunately, they are not. If your blog is scraped almost simultaneously without a backlink, is next to impossible for Google to decided who has the original copy and who is a copycat. This is where your page rank may come into play. Sadly, if your blog thief has a much higher page rank you may the one labeled as a copy cat. Thus blog thieves get away with the theft and thrive while the original website maybe penalized. Is that fair? Definitely not.<br />
Here is a few things you can do to detect plagiarism and protect your website from automatic scrapers who make up for the majority of content theft.</p>
<p>1. <strong>Submit your url to <a title="Copyscape" href="http://www.copyscape.com/" target="_blank">Copyscape</a> to see who is stealing your website or blog content.</strong></p>
<p>2. <strong>If content thieves don’t show up immediately in the search results, try to repeat your search a few times with omitted results.</strong></p>
<p>3. <strong>Set up Google  Alerts for key phrases related to your content.</strong></p>
<p>4. <strong>Set the feeds to “summary” which will give the scrapers only an extract from the entire content as opposed to the entire post.</strong></p>
<p>5. <strong>Add legal disclaimer in the header or the footer of the post.</strong></p>
<p>6. <strong>Use tools such as <a title="Numly WordPress Plugin" href="http://www.numly.com/numly/wordpress.asp" target="_blank">Numly WordPress Plugin</a>,  <a title="RSS Footer" href="http://yoast.com/wordpress/rss-footer/ " target="_blank">RSS Footer</a>, and <a title="TagRight" href="http://www.tagright.com/compsys.php?name=Home" target="_blank">TagRight</a> to restrict the access to your content and establish your authorship.</strong></p>
<p>7. <strong>Force blog scrapers to link back to your site by</strong></p>
<p>a) including links which point to specific pages on your website in the blog copy</p>
<p>b) linking to your blog post from within the feed content.Read <a title="Tips" href="http://collinlahay.com/2008/01/11/protect-yourself-from-content-theft/" target="_blank">tips</a> from Matt Cutt on how to do it.</p>
<p>c) using the free tool called <a title="Tynt Tracer" href="http://www.tynt.com/" target="_self">Tynt Tracer</a> (it inserts a short piece of code on your blog to prevent scraping without backlinks)</p>
<p>Now the bad news. There is little protection against the manual scraping. Combating this is a time consuming process which doesn’t always lead to the desirable outcome. If the entire text is copied without referencing to you, it is still legally enforceable under the copyright laws, however, if only parts of the content are stolen, your actions are very limited. The common suggestion is to put your craving for justice aside for a moment and assess the damage as to whether it is worth your time to go after the thieves. If the damage is tangible, you can</p>
<p>• <strong>send them an e-mail and request to take the content down</strong><br />
• <strong>flag them on the website</strong><br />
• <strong>modify RSS feed for the offenders or even block your IP address</strong><br />
• <strong>inform search engines through DCMA (Digital Millennium Copyright Act) letters</strong><br />
•<strong> seek legal action ( in extreme cases)</strong></p>
<p>Due to the efforts of our <a title="SEO" href="http://www.yellowseo.com/business_seo_services/" target="_self">SEO</a> team, the blog entry was back into the search results and the copycats were pushed far down. It was a good learning experience but it also made me think that blog scraping is a small drop in the ocean of plagiarism. Let’s face it. Individuals as well as companies bluntly rip off entire websites and business ideas every day. The least Google and other search engines can do is develop better ways of detecting and combating content theft.<br />
P.S. If this blog gets scraped, wouldn’t that be awesomely ironic?</p>
<img src="http://www.yellowseo.com/blog/?ak_action=api_record_view&id=177&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://www.yellowseo.com/blog/general-musings/stop-blog-scrapers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
