Distributed web crawling

Distributed web crawling is a distributed computing technique whereby Internet search engines employ many computers to index the Internet via web crawling. Such systems may allow for users to voluntarily offer their own computing and bandwidth resources towards crawling web pages. By spreading the load of these tasks across many computers, costs that would otherwise be spent on maintaining large computing clusters are avoided.

 

Published by

Gennaro Cuofano

Creator of FourWeekMBA.com | Head of Business Development at WordLift.io | International MBA

Leave a Reply