Brandon Klein Brandon understands that better teams are fundamental to all of our success. As a global thought leader, ushering in the 'Future of Work' revolution, he paves the way using data + design to accelerate the Collaboration Revolution. Brandon is the Co-Founder of the software start-up, Collaboration.Ai and an active member of The Value Web, a non-profit committed to changing the way decisions are made to better impact our world. Aug 29

Teaching machines to read between the lines (and a new corpus with entity salience annotations)

Language understanding systems are largely trained on freely available data, such as the Penn Treebank, perhaps the most widely used linguistic resource ever created. We have previously released lots of linguistic data ourselves, to contribute to the language understanding community as well as encourage further research into these areas.

Now, we’re releasing a new dataset, based on another great resource: the New York Times Annotated Corpus, a set of 1.8 million articles spanning 20 years. 600,000 articles in the NYTimes Corpus have hand-written summaries, and more than 1.5 million of them are tagged with people, places, and organizations mentioned in the article. The Times encourages use of the metadata for all kinds of things, and has set up a forum to discuss related research.

articles, machinelearning, peoplescience, bigdata, Collaboration Article, collaborative web link

Brandon Klein Brandon understands that better teams are fundamental to all of our success. As a global thought leader, ushering in the 'Future of Work' revolution, he paves the way using data + design to accelerate the Collaboration Revolution. Brandon is the Co-Founder of the software start-up, Collaboration.Ai and an active member of The Value Web, a non-profit committed to changing the way decisions are made to better impact our world.