Brandon Klein Brandon understands that better teams are fundamental to all of our success. As a global thought leader, ushering in the 'Future of Work' revolution, he paves the way using data + design to accelerate the Collaboration Revolution. Brandon is the Co-Founder of the software start-up, Collaboration.Ai and an active member of The Value Web, a non-profit committed to changing the way decisions are made to better impact our world. Mar 08

wiki-links - Wikipedia Links Data - Google Project Hosting

Google Research just launched its Wikilinks corpus, a massive new data set for developers and researchers that could make it easier to add smart disambiguation and cross-referencing to their applications. The data could, for example, make it easier to find out if two web sites are talking about the same person or concept, Google says. In total, the corpus features 40 million disambiguated mentions found within 10 million web pages. This, Google notes, makes it “over 100 times bigger than the next largest corpus,” which features fewer than 100,000 mentions.

For Google, of course, disambiguation is something that is a core feature of the Knowledge Graph project, which allows you to tell Google whether you are looking for links related to the planet, car or chemical element when you search for ‘mercury,’ for example. It takes a large corpus like this one and the ability to understand what each web page is really about to make this happen.

API, collaboration company, database, bigdata, collaborative web link, datamining, integration, nodes

Brandon Klein Brandon understands that better teams are fundamental to all of our success. As a global thought leader, ushering in the 'Future of Work' revolution, he paves the way using data + design to accelerate the Collaboration Revolution. Brandon is the Co-Founder of the software start-up, Collaboration.Ai and an active member of The Value Web, a non-profit committed to changing the way decisions are made to better impact our world.