[IPv6crawler-wg] In search of domain names - found!
Olivier MJ Crepin-Leblond
ocl at gih.com
Mon Mar 22 08:00:24 GMT 2010
Good news!
Looking through the Web last night, I came across a link to download a
CSV file of Alexa"s 1 million (yes, you're reading this correctly - 1
000 000 - most popular Web sites. This provides us with the ability of
having a great sample of domain names to test. We can select "UK
universities" by doing an fgrep on ".ac.uk" and get 400 results.
We can select "UK government Web sites" and get 453 domains.
Egyptian sites ending in ".eg" = 175 sites etc. etc.
These are great samples!
The file is available in my sub-directory /home/ocl/data on
crawler.ipv6matrix.org
Apparently it gets updated daily on the Alexa Web site - but I don't
hink that we need to update ours more than once every few months.
Warm regards,
Olivier
--
Olivier MJ Crépin-Leblond, PhD
http://www.gih.com/ocl.html
More information about the IPv6crawler-wg
mailing list