Content Recycling / Content Scraping
Scraping makes a whole lot of sense while dealing with content recycling problem. By content recycling, I mean here republishing content which is no longer available / searchable on Google which has been removed from Google index. The procedure is to pick up websites from directories as archives.org , checking it on Google index and if it does not exist, publishing it somewhere and submitting to Google’s index (by lot of ways we know). Content scraping is otherwise, dealt as duplicate content penalty from Google if the index check is not done and content already exists on Google.
The problem given is to algorithmically write content for Search Engine Optimization of a travel portal (name confidential). Read the rest of this entry »
