Sheffield NERD Tweet Corpus

The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes from financial institutions and news outlets. The rest are random tweets collected in 2014 by using keywords related to the climate change topic.Tweets are in GATE FastInfoset format.

Data and Resources
To access the resources you must log in
  • Sheffield NERD Tweet CorpusFINF

    794 tweets annotated with named entities disambiguated against DBpedia, and...

    The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Accessibility Both
AccessibilityMode OnLine Access
AccessibilityMode Download
Availability On-Line
Basic rights Temporary download of a single copy only
Basic rights Download
Basic rights Copying
Basic rights Distribution
Basic rights Modification
Basic rights Communication
Basic rights Making available to the public
ChildrenData No
Consent obtained also covers the envisaged transfer of the personal data outside the EU No
Consent of the data subject No
CreationDate 2015-05-23
Creator Gorrell, Genevieve
DataProtectionDirective Data Protection Act 1998
Field/Scope of use Non-commercial only
Language eng, English
License term /Not specified
ManifestationType Virtual
Personal data was manifestly made public by the data subject Yes
PersonalData No
PersonalSensitiveData Select PersonalSensitiveData
ProcessingDegree Secondary
Restrictions on use Non-commercial, share under the same license.
Sublicense rights No
Territory of use World Wide
ThematicCluster Text and Social Media Mining
TimeCoverage 2013-01-01 /2014-12-31
system:type Dataset
Management Info
Field Value
Author Genevieve Gorrell, Johann Petrak, Kalina Bontcheva
Maintainer Gorrell Genevieve
Version 1
Last Updated 22 December 2020, 17:53 (CET)
Created 29 June 2018, 11:34 (CEST)