59 items found

Formats: ZIP

Filter Results
  • Dataset

    Reddit Echo Chamber dataset

    In a digital environment, the term echo chamber refers to an alarming phenomenon in which beliefs are amplified or reinforced by communication repetition inside a closed...
    • ZIP
      The resource: 'Reddit Echochamber' is not accessible as guest user. You must login to access it!
  • Dataset

    Fire smoke detection dataset

    Dataset of fire, non fire, and smoke images
    • ZIP
      The resource: 'Ilenia Ficili' is not accessible as guest user. You must login to access it!
  • Dataset

    Weather and Pollution in Smart Cities

    A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based...
    • ZIP
      The resource: 'Weather and Pollution in ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Spotify track dataset (small)

    The dataset is created exploiting the Spotify API and the tracks id provided by the authors of https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-dataset.... The...
    • ZIP
      The resource: 'std_small' is not accessible as guest user. You must login to access it!
  • Dataset

    SWH Filenames

    A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...
    • ZIP
      The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
  • Dataset

    DNA 31-mers

    A 12 GB dataset containing all the ~367M unique 31-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...
    • ZIP
      The resource: 'DNA 31-mers' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Smart Cities Weather and Pollution conditions

    A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based...
  • Dataset

    Compounds with Activity against the Dopamine D2 Receptor

    Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...
    • ZIP
      The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
  • Dataset

    GiveMeSomeCreditSC

    The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...
    • ZIP
      The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
  • Dataset

    DNA 12-mers

    A 179 MB dataset containing all the ~14M unique 12-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...
    • ZIP
      The resource: 'DNA 12-mers' is not accessible as guest user. You must login to access it!
  • Dataset

    Santorini Tweets July-August 2021

    This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...
    • ZIP
      The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Synthetic Mobility Purpose Dataset

    A synthetically generated dataset representing purpose-of-motion data in the format of individual mobility networks.
    • ZIP
      The resource: 'Synthetic purpose of ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Shopping retail synthetic dataset (CopulaGAN)

    Synthetic shopping retail consumption data generated with CopulaGAN. The dataset provides monthly information on the spending of synthetic customers belonging to two classes...
    • ZIP
      The resource: 'Shopping retail synthetic ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Identified CNVs from whole exome sequencing data of BRCA1/2 negative breast c...

    This dataset offers a comprehensive analysis of Copy Number Variations (CNVs) identified in Whole Exome Sequencing (WES) data from patients with breast cancer who tested...
  • Dataset

    Twitter Newcomers Dataset

    Twitter accounts detected right after registration and monitored for 21 days
    • ZIP
      The resource: 'New Accounts Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Medical Dataset

    The medical dataset contains a corpus of fully anonymized clinical text. Each document in the corpus is associated with a set of ICD-9 codes which represents the diagnosis...
    • ZIP
      The resource: 'Medical Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2016 Amatrice earthquake

    This dataset contais Italian tweets related to the earthquake of 2016 in the Centre of Italy (https://it.wikipedia.org/wiki/Terremoto_del_Centro_Italia_del_2016_e_d...). is...
    • ZIP
      The resource: 'EAQ-AMA.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2013 Sardinia flood

    This dataset is related to the floods that occurred in the Sardinia regional district between 17 and 19 November 2013 (https://en.wikipedia.org/wiki/2013_Sardinia_floods), as...
    • ZIP
      The resource: 'FLO-SAR.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2009 L'Aquila earthquake

    This dataset comprises 1,100 Italian tweets shared in the aftermath of the 2009 L’Aquila earthquake (https://en.wikipedia.org/wiki/2009_L%27Aquila_earthquake). The earthquake...
    • ZIP
      The resource: 'EAQ-LAQ.zip' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'geo-annotated tweets.zip' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).