approved
Cherenkov Telescope Data for Ordinal Quantification

This labeled data set is targeted at ordinal quantification. It appears in our research paper "Ordinal Quantification Through Regularization", which we have published at ECML-PKDD 2022. The goal of quantification is not to predict the label of each individual instance, but the distribution of labels in unlabeled sets of data. With the scripts provided, you can extract the relevant features and labels from the public data set of the FACT Cherenkov telescope. These features are precisely the ones that domain experts from astro-particle physics employ in their analyses. The labels stem from a binning of a continuous energy label, which is common practice in these analyses. We complement this data set with the indices of data items that appear in each sample of our evaluation. Hence, you can precisely replicate our samples by drawing the specified data items. The indices stem from two evaluation protocols that are well suited for ordinal quantification. To this end, each row in the files app_val_indices.csv, app_tst_indices.csv, app-oq_val_indices.csv, and app-oq_tst_indices.csv represents one sample. Our first protocol is the artificial prevalence protocol (APP), where all possible distributions of labels are drawn with an equal probability. The second protocol, APP-OQ, is a variant thereof, where only the smoothest 20% of all APP samples are considered. This variant is targeted at ordinal quantification tasks, where classes are ordered and a similarity of neighboring classes can be assumed. The labels of the FACT data lie on an ordinal scale and, hence, pose such an ordinal quantification task.

Tags
Data and Resources
To access the resources you must log in
  • Zenodo

    Link to the site containing the data to download

    The resource: 'Zenodo' is not accessible as guest user. You must login to access it!
Personal Data Attributes

Description: Personal Data related Information

Field Value
Anonymised Pseudo Anonymized
ChildrenData No
Cross Border Authorised Yes
Data Protection Impact Assessment Yes
Ethics Committee Approval Yes
General Data Yes
Informed Consent Template Yes
Personal Data No
Personal data was manifestly made public by the data subject No
Sensitive Data No
Additional Info
Field Value
Accessibility Both
Accessibility Mode Download
Availability On-Line
Basic rights Download
Creation Date 2022-09-18
Creator Bunse, Mirko, mirko.bunse@cs.tu-dortmund.de, orcid.org/0000-0002-5515-6278
Dataset Citation Bunse, Mirko, Moreo, Alejandro, Sebastiani, Fabrizio, & Senz, Martin. (2022). Cherenkov Telescope Data for Ordinal Quantification (v0.1.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.7090095
Dataset Re-Use Safeguards None
DiskSize 23.4
External Identifier 10.5281/zenodo.7090095
Field/Scope of use Non-commercial research only
Format zip
Group Others
Language eng, English
License term 2022-09-18 /2032-09-18
Manifestation Type Virtual
Processing Degree Secondary
Retention Period 2022-09-18 /2032-09-18
Size 23.4 MB
Sublicense rights No
Territory of use World Wide
Thematic Cluster Other
system:type Dataset
Management Info
Field Value
Author Moreo Alejandro
Maintainer Moreo Alejandro
Version 1
Last Updated 17 June 2023, 08:23 (CEST)
Created 17 February 2023, 16:25 (CET)