Common Crawl - free big data sets
Everyone should have the opportunity to indulge their curiosities, analyze the world and pursue brilliant ideas. Small startups or even individuals can now access high quality crawl data that was previously only available to large search engine corporations.
For more information about the corpus, look at our Get Started page.
Our Google Group is an active hub for technologists to collaborate and ask questions. Our Twitter feed is a great way for everyone to keep up with our latest news, thoughts and to engage with the Common Crawl community.
We invite you to join the San Francisco Bay Area based Open Data Bay Area Meetup, a group where you can meet other supporters of Open Data like yourself.