DepSpid

What is DepSpid?

DepSpid is a distributed kind of a web crawler. The DepSpid spider visits domains, analyses links and finally calculates scores about the link dependencies between individual domains. Each spider job starts at the main page of a domain and then follows each link on that page retrieving more pages and analysing them, too. The spider stays within one domain. If it finds an external link it only checks if the linked domain is reachable but doesn't continue crawling into the external domain. Every unknown domain will be visited from another spider job at a later time.

The DepSpid spider is currently under devlopment. Once it's running in production mode, the data collected by the spider will be publically available and will give webmasters a new kind of sight into their own or foreign domains.
(c) 2006 Bjoern Henke [Contact].