Medical Dataset

The medical dataset contains a corpus of fully anonymized clinical text. Each document in the corpus is associated with a set of ICD-9 codes which represents the diagnosis associated with the clinical report. To each report might be assigned several ICD-9 codes.

Data and Resources
To access the resources you must log in
  • Medical DatasetZIP

    The resource: 'Medical Dataset' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Accessibility Both
AccessibilityMode Download
Attribution requirements
Availability On-Line
Basic rights Download
ChildrenData N/A (Not appliable)
Consent obtained also covers the envisaged transfer of the personal data outside the EU N/A (Not appliable)
Consent of the data subject N/A (Not appliable)
CreationDate 2019-01-23
Creator Guidotti, Riccardo,
DataProtectionDirective Kaggle Regulations
Display requirements
Distribution requirements
External Identifier
Field/Scope of use Any use
Language Select Language
License term /Not specified
ManifestationType Virtual
Personal data was manifestly made public by the data subject N/A (Not appliable)
PersonalData No
PersonalSensitiveData Select PersonalSensitiveData
ProcessingDegree Primary
Requirement of non-disclosure (confidentiality mark)
Restrictions on use
Semantic Coverage
Sublicense rights No
Territory of use World Wide
ThematicCluster Social Data
TimeCoverage 2019-01-23
system:type Dataset
Management Info
Field Value
Author Guidotti Riccardo
Maintainer Guidotti Riccardo
Version 1
Last Updated 22 December 2020, 17:47 (CET)
Created 23 January 2019, 15:39 (CET)