-
Municipality Transition index in Spain
Computation of the Municipality Transition Index (MTI) in Spain. Data are collected according to the method described in according to the paper: Alessio Muscillo, Simona Re,...-
XLSX
The resource: 'Municipality Transition ...' is not accessible as guest user. You must login to access it!
-
XLSX
-
The subTHz regime, first Results on channel measurement: 500-750 GHz
The measurements have been conducted using a Keysight PNA Vector Analyzer connected to a pair of VDI Extenders for the frequency bands 500-750 GHz (W-band). IF bandwidth has... -
The subTHz regime, first Results on channel measurement: : 170-260 GHz (nearl...
The measurements have been conducted using a Keysight PNA Vector Analyzer connected to a pair of VDI Extenders for the frequency bands) 170-260 GHz (nearly G-band). IF... -
The subTHz regime, first Results on channel measurement: 75-110 GHz (W-band)
The measurements have been conducted using a Keysight PNA Vector Analyzer connected to a pair of VDI Extenders for the frequency bands 75-110 GHz (W-band). IF bandwidth has... -
Wi-Fi channel frequency response database for contactless human activity reco...
This database collects the channel frequency response (CFR) vectors captured through the Nexmon CSI extraction tool from an Asus RT-AC86U IEEE 802.11ac Wi-Fi router working with... -
Spotify Tracks Dataset (full)
The dataset is created exploiting the Spotify API and the tracks id provided by the authors of https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-dataset.... The... -
Spotify track dataset (small)
The dataset is created exploiting the Spotify API and the tracks id provided by the authors of https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-dataset.... The...-
ZIP
The resource: 'std_small' is not accessible as guest user. You must login to access it!
-
ZIP
-
Air Quality Datasets over L'Aquila Region
These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData. -
SWH Filenames
A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...-
ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
-
ZIP
-
DNA 31-mers
A 12 GB dataset containing all the ~367M unique 31-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...-
ZIP
The resource: 'DNA 31-mers' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Smart Cities Weather and Pollution conditions
A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based... -
Compounds with Activity against the Dopamine D2 Receptor
Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...-
ZIP
The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
-
ZIP
-
Gene Disease Association Data and Features
This dataset contains data that can be used for disease gene discovery purposes. The data cover ten different diseases with associated seed genes (derived from DisGeNET) and...-
RAR
The resource: 'Gene_Disease_Association_Da ...' is not accessible as guest user. You must login to access it!
-
RAR
-
Dataset of generic users interactions in Twitter, to evaluate polarity of rel...
A dataset collecting Twitter interactions between generic users, to evaluate the polarity of their social relationships at the level of ego networks. Generic users serve as a...-
TSV
The resource: 'Baseline' is not accessible as guest user. You must login to access it!
-
TSV
-
Reducing radicalizism in social networks by feeds prioritization - Rebalancin...
Code and description of the methodology of the paper "Rebalancing Social Feed to Minimize Polarization and Disagreement" funded by SoBigData ++ -
Synthetic Datasets for Fine-Grained Fairness Analysis of Abusive Language Det...
Three synthetic datasets covering different types of bias grouped by target, namely sexism, racism and ableism. The reason for distinguishing the records by abuse targets is...-
CSV
The resource: 'Synthetic Datasets for ...' is not accessible as guest user. You must login to access it!
-
CSV
-
GiveMeSomeCreditSC
The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...-
ZIP
The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
-
ZIP
-
Synthetic Dataset for Causal Analysis
The dataset is a synthetic version of the well-known German Credit dataset (https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data). It includes variables such as...-
CSV
The resource: 'synthetic german data' is not accessible as guest user. You must login to access it!
-
CSV
-
-
CSV
The resource: 'World Trade Web_2000' is not accessible as guest user. You must login to access it!
-
CSV
-
DNA 12-mers
A 179 MB dataset containing all the ~14M unique 12-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...-
ZIP
The resource: 'DNA 12-mers' is not accessible as guest user. You must login to access it!
-
ZIP