459 items found

Organisations: SoBigData Services and Products

Filter Results
  • Dataset

    The subTHz regime, first Results on channel measurement: 75-110 GHz (W-band)

    The measurements have been conducted using a Keysight PNA Vector Analyzer connected to a pair of VDI Extenders for the frequency bands 75-110 GHz (W-band). IF bandwidth has...
    • s2p
      The resource: '75-110 GHz 30 cm' is not accessible as guest user. You must login to access it!
    • s2p
      The resource: '75-110 GHz 60 cm' is not accessible as guest user. You must login to access it!
    • s2p
      The resource: '75-110 GHz 90 cm' is not accessible as guest user. You must login to access it!
  • Dataset

    Wi-Fi channel frequency response database for contactless human activity reco...

    This database collects the channel frequency response (CFR) vectors captured through the Nexmon CSI extraction tool from an Asus RT-AC86U IEEE 802.11ac Wi-Fi router working with...
    • The resource: 'Wi-Fi channel frequency ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Spotify Tracks Dataset (full)

    The dataset is created exploiting the Spotify API and the tracks id provided by the authors of https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-dataset.... The...
    • The resource: 'std_full' is not accessible as guest user. You must login to access it!
  • Dataset

    Spotify track dataset (small)

    The dataset is created exploiting the Spotify API and the tracks id provided by the authors of https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-dataset.... The...
    • ZIP
      The resource: 'std_small' is not accessible as guest user. You must login to access it!
  • Dataset

    Air Quality Datasets over L'Aquila Region

    These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData.
  • Dataset

    SWH Filenames

    A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...
    • ZIP
      The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
  • Dataset

    DNA 31-mers

    A 12 GB dataset containing all the ~367M unique 31-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...
    • ZIP
      The resource: 'DNA 31-mers' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Smart Cities Weather and Pollution conditions

    A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based...
  • Dataset

    Compounds with Activity against the Dopamine D2 Receptor

    Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...
    • ZIP
      The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
  • Method

    Reducing radicalizism in social networks by feeds prioritization - Rebalancin...

    Code and description of the methodology of the paper "Rebalancing Social Feed to Minimize Polarization and Disagreement" funded by SoBigData ++
  • Dataset

    Synthetic Datasets for Fine-Grained Fairness Analysis of Abusive Language Det...

    Three synthetic datasets covering different types of bias grouped by target, namely sexism, racism and ableism. The reason for distinguishing the records by abuse targets is...
    • CSV
      The resource: 'Synthetic Datasets for ...' is not accessible as guest user. You must login to access it!
  • Dataset

    GiveMeSomeCreditSC

    The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...
    • ZIP
      The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
  • Dataset

    Synthetic Dataset for Causal Analysis

    The dataset is a synthetic version of the well-known German Credit dataset (https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data). It includes variables such as...
    • CSV
      The resource: 'synthetic german data' is not accessible as guest user. You must login to access it!
  • Dataset

    DNA 12-mers

    A 179 MB dataset containing all the ~14M unique 12-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...
    • ZIP
      The resource: 'DNA 12-mers' is not accessible as guest user. You must login to access it!
  • Dataset

    FANCY Dataset

    (NLI) FANCY (FActivity, Negation, Common-sense, hYpernimy) is a new dataset with 4000 sentence pairs concerning complex linguistic phenomena such as factivity, negation,...
    • The resource: 'FANCY Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Santorini Tweets July-August 2021

    This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...
    • ZIP
      The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    World Trade Web_2020

    Weighted, directed adjacency matrix of the World Trade Web in the year 2020
    • CSV
      The resource: 'WTN_adj_2020' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Post-earthquake Reconstruction Progress Datasets over L'Aquila City

    Reconstruction data sets, provided by the National Public Entities of USRA and USRC. These data sets are stored in CSV files and provide comprehensive information related to...
  • Dataset

    Carbon Trade Network_2020

    Weighted, directed adjacency matrix of the Carbon Trade Network in the year 2020
    • CSV
      The resource: 'CTN_adj_2020' is not accessible as guest user. You must login to access it!
  • Dataset

    Carbon Trade Network_2000

    Weighted, directed adjacency matrix of the Carbon Trade Network in the year 2000
    • CSV
      The resource: 'CTN_adj_2000' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).