Fighting scraper sites… (aka plagiarism, aka stealing, aka duplicate content)

The image
{ scraping by Greg via CC }

We’ve had some discussions about fighting scraped content (aka Plagiarism, aka stolen content, AKA scrapper sites, AKA duplicate content) on Mahalo recently. Our team recently put together a best practice for this with a tool and a short video today: http://greenhouse.mahalo.com/Plagiarism_Tool

You can never beat all the stolen/scraped content out there, but we can certainly do a better job than machine search engines.



No Comments »

No comments yet.

RSS feed for comments on this post. TrackBack URL

Leave a comment



Toro, a bulldog

Hello. My name is Jason.
I'm the CEO of Mahalo.com, a human powered search engine. I was previously the co-founder of Weblogs, Inc. with Brian Alvey, and the GM of Netscape.

I'm currently on the board of social shopping site ThisNext. You might remember me from my days as editor and CEO of the Silicon Alley Reporter magazine.

Mike Arrington and I partnered on the TechCrunch40 event in September. We're going to do it again next year.

This is my blog, this is where I live. You should also listen to my podcast.


Add me on Facebook, Twitter, MySpace, LinkedIn, Delicious, Pownce
Jason Calacanis on tumblr, mixx, Flickr



follow JasonCalacanis at http://twitter.com

www.flickr.com
jasoncalacanis' photos More of jasoncalacanis' photos





View Jason Calacanis's profile
on LinkedIn

Shopcast powered by
www.ThisNext.com

Daily Reads

Recent Comments

RSS NEWSFEEDS