Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

There's a lot going on here. By clicking on seed, we open up a new table showing the domains harvested as part of that seed. There are still some things to be ironed out about what is and isn't a domain so "ruedinger.dk" and "www.ruedinger.dk" both appear.   DNS lookups are treated as a separate domain. The URLs for all the domains should add up to the URLs for the corresponding seed and the %ages to 100&. The two duplicate-related columns show that all 4459 duplicates found came from the domain ruedinger.dk. That is to say of the 5.15% of the total data which was duplicated, 100% came from that domain.

...