PageTraffic SEO Blog

Subscribe To Page Traffic Blog
Subscribe via RSS
Subscribe via Email

Google Gets Duplicate Content Detection Patent

January 3rd, 2007 | 101 Views RSS Feed



If you're new here, you may want to subscribe to our Full RSS feed to get a daily digest of news around search engine industry.

Google has gained an important patent from US Patent office for 'Methods and apparatus for estimating similarity'. Google had filed for it on December 31, 2001. Using this patent, the search engine giant can develop duplicate content detection tools for the webmasters.

The US Patent Office explains on the new patent, “A similarity engine generates compact representations of objects called sketches. Sketches of different objects can be compared to determine the similarity between the two objects. The sketch for an object may be generated by creating a vector corresponding to the object, where each coordinate of the vector is associated with a corresponding weight. The weight associated with each coordinate in the vector is multiplied by a predetermined hashing vector to generate a product vector, and the product vectors are summed. The similarity engine may then generate a compact representation of the object based on the summed product vector”.

A WebmasterWorld Forum thread says,"Anything they do to reduce duplicate content showing up in SERPS is a good thing. Thanks to Google for trying to help web users sick of seeing copies".

Click here to subscribe to our RSS feed to get a daily digest of news around search engine industry. PageTraffic SEO Blog is updated four times a day and is ranked as one of the best search engine resources blog by Pandia!


 


Comments

Leave a Reply

Hire Full Time SEO Consultant


Subscribe To Our SEO Blog


Enter your email address:

Delivered by FeedBurner



The Associates

SEO Blogs - Blog Catalog Blog Directory

Back to Top

Copyright © 2006-2009 PageTraffic SEO Blog. All rights reserved.

RSS feeds. WordPress Theme by Candid Software.

Feedback Form