FAQ

“NetworkCartographer” is an academic project of the UAS Niederrhein (FB03 - Electrical Engineering and Computer Science) in Krefeld, Germany.

Web domains are collected, including their links to other web domains.

The following information is stored about the crawled domains:

  • Domain (host, protocol, port)
  • IP address of the server
  • Timestamps (first crawl, last crawl)

The following information is stored via the linked domains:

  • Number of different links
  • Number of HTTP status codes (3XX, 4XX, 5XX, 9XX)
  • Timestamps (first crawl, last crawl)

There is currently no implemented use case. The information is being collected to test the developed system and for future research projects.

The data is stored on an internal university server and is only accessible to authorized persons.

This group of people only includes project managers and the server administrator.

The crawler takes the rules in “/robots.txt” into account.

The crawler can be blocked via an entry in “/robots.txt”. A rule chain can be implemented for the “NetworkCartographer” user agent for this purpose.