| CommonCrawl

Posted on by Brandon Klein

Common Crawl is a non-profit foundation dedicated to providing an open repository of web crawl data that can be accessed and analyzed by everyone.


Check out the new hyperlink graph analysis of the 2012

Common Crawl corpus by Web Data Commons!

The talented team at Web Data Commons extracted and analyzed the hyperlink graph

within the 2012 Common Crawl corpus. You can see the results on their website.