84 items found

Licenses: Academic Free License 3.0 Availability: On-Line Groups: sobigdata-eu

Filter Results
  • Dataset

    Common Crawl Financial News Dataset

    This dataset contains financial articles related to companies in the S&P500 index for the period from September 2016 to February 2020. The articles were extracted from the...
    • CSV
      The resource: 'Common_Crawl_Financial_News' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental office room conditions

    The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in smart office rooms located in ICAR-CNR, for...
    • RAR
      The resource: 'IoT_dataset_smart_office' is not accessible as guest user. You must login to access it!
  • Dataset

    RAN and NWDAF data from Cellular Network in Catania

    Dataset containing various RAN and UEs metrics collected from 4 BSs deployed at Piazza D'Uomo, Catania. Metrics can be used for machine learning-based studies for physical...
    • The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    The Hackernews dataset

    This corpus has been extracted from The Hacker News website (https://thehackernews.com), a CS news platform that attracts over 8 million readers monthly, which is daily...
    • The resource: 'the-hackenews' is not accessible as guest user. You must login to access it!
  • Dataset

    Santorini Tweets July-August 2021

    This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...
    • ZIP
      The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Kinematic Features of Porto Taxi Trips

    This dataset comprises tabular information related to the movement of taxis in the city of Porto, Portugal. For every taxi journey, we segmented the trajectory into 20...
    • CSV
      The resource: 'TIF - taxi trajectory data' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Dutch sample)

    The Semantic Networks from news articles (Dutch sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Dutch_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Portuguese sample)

    The Semantic Networks from news articles (Portuguese sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Portuguese_sampleNet_anonym ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Yeast

    The yeast dataset is a collection of yeast microarray expressions and phylogenetic profiles which can be used to learn the yeast gene functional categories. One row of this...
    • arff
      The resource: 'Yeast Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Mobility index for local quarantines in Chile

    Fighting the COVID-19 pandemic, most countries have implemented non-pharmaceutical interventions like wearing masks, physical distancing, lockdown, and travel restrictions....
    • CSV
      The resource: 'Mobility Index for Local ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Medical Dataset

    The medical dataset contains a corpus of fully anonymized clinical text. Each document in the corpus is associated with a set of ICD-9 codes which represents the diagnosis...
    • ZIP
      The resource: 'Medical Dataset' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'Churn Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    German Credit

    In the german credit dataset each one of the 1,000 persons is classified as a good or bad creditor according to attributes like age, sex, checking_account, credit_amount,...
    • CSV
      The resource: 'German Credit' is not accessible as guest user. You must login to access it!
  • Dataset

    Compas

    The compas dataset contains the features used by the COMPAS algorithm for scoring defendants and their risk (Low, Medium and High), for over $4,000$ individuals. We considered...
    • CSV
      The resource: 'https://www' is not accessible as guest user. You must login to access it!
  • Dataset

    Dataset Adult

    The adult dataset includes $48,842$ instances with demographic information like age, workclass, marital-status, race, capital-loss, capital-gain etc. The income attribute...
    • CSV
      The resource: 'Adult' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2016 Amatrice earthquake

    This dataset contais Italian tweets related to the earthquake of 2016 in the Centre of Italy (https://it.wikipedia.org/wiki/Terremoto_del_Centro_Italia_del_2016_e_d...). is...
    • ZIP
      The resource: 'EAQ-AMA.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2013 Sardinia flood

    This dataset is related to the floods that occurred in the Sardinia regional district between 17 and 19 November 2013 (https://en.wikipedia.org/wiki/2013_Sardinia_floods), as...
    • ZIP
      The resource: 'FLO-SAR.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2009 L'Aquila earthquake

    This dataset comprises 1,100 Italian tweets shared in the aftermath of the 2009 L’Aquila earthquake (https://en.wikipedia.org/wiki/2009_L%27Aquila_earthquake). The earthquake...
    • ZIP
      The resource: 'EAQ-LAQ.zip' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2013 Milan blackout

    This dataset is related to a power outage (i.e., a blackout) that occurred in the city of Milan, in northern Italy, in the night between 14 and 15 May 2013. Despite not...
    • CSV
      The resource: 'PWO-MIL_tweets.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2011 Christchurch earthquake

    This dataset contains tweets related to the devastating earthquake occurred on 22 February 2011, at around 12 p.m. local time in Christchurch, New Zealand...
    • CSV
      The resource: 'EAQ-CHR_tweets.csv' is not accessible as guest user. You must login to access it!