Wikinews dataset

This dataset consists of a sample of 365 news published by Wikinews from November 2004 to June 2014 and annotated with about 5000 entities, each associated with a saliency score, by the Wikinews community.

Tags
Data and Resources
To access the resources you must log in
  • entity-saliencyJSON

    The resource: 'entity-saliency' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Accessibility Both
AccessibilityMode Download
Area
Attribution requirements
Availability On-Line
Basic rights Download
ChildrenData No
Consent obtained also covers the envisaged transfer of the personal data outside the EU No
Consent of the data subject No
CreationDate 2016-11-04
Creator Trani, Salvatore, salvatore.trani@istu.cnr.it
DataProtectionDirective none
DiskSize
Display requirements
Distribution requirements
External Identifier
Field/Scope of use Any use
Format
FormatSchema
IP/Copyrights
Language eng, English
License term /Not specified
ManifestationType Virtual
Personal data was manifestly made public by the data subject No
PersonalData No
PersonalSensitiveData Select PersonalSensitiveData
ProcessingDegree Primary
RelatedPaper https://dl.acm.org/citation.cfm?doid=2960811.2960819
Requirement of non-disclosure (confidentiality mark)
Restrictions on use
Semantic Coverage
Size
Sublicense rights No
Territory of use World Wide
ThematicCluster Text and Social Media Mining
TimeCoverage 2004-01-01 /2014-12-31
spatial
system:type SoBigData.eu: Dataset
Management Info
Field Value
Author Ferragina Paolo
Maintainer Ferragina Paolo
Version 1
Last Updated 29 June 2018, 11:33 (CEST)
Created 29 June 2018, 11:33 (CEST)