Datasets | Center for Complex Networks and Systems Research

Posted on by Brandon Klein

Metadata for the complete set of all PubMed records through 2012 (with part of 2013 available as well), including title, authors, and year of publication. All data provided originates from NLM’s PubMed database (as downloaded April 24, 2013 from the NLM FTP site) and was retrieved via the Scholarly Database.


A collection of bookmarks from for the month of November 2009. (More on GiveALink project)


  • Dataset size: 22.5 mil tweets
  • File size: 2.5GB uncompressed
  • Time Period: Nov 1, 2012 – Nov 30, 2012

Data: tweets-nov-2012.tgz (707MB)