79 items found

Types: SoBigData.eu: Dataset

Filter Results
  • SoBigData.eu: Dataset

    Brexit Tweets Linked Domains

    In this spreadsheet we share domains linked in the UK EU membership referendum tweet collection. Counts for links by leave voters and remain voters are given, enabling sites...
    • The resource: 'Brexit Tweets Linked ...' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Wikipedia Word Embeddings

    Embeddings were created through applying word2vec skipgram to a corpus of wikipedia non-stub articles from a December 2015 English dump with the following parameters: -cbow 0...
    • The resource: 'Embeddings' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Word Sense Evolution Testset

    This testset consists of 23 terms which have experienced word sense change during the past centuries. The main changes for each term were found using Wikipedia, dictionary.com...
    • ZIP
      The resource: 'WSE-testset.zip' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Emergency Tweets 2016 Amatrice earthquake

    This dataset contais Italian tweets related to the earthquake of 2016 in the Centre of Italy (https://it.wikipedia.org/wiki/Terremoto_del_Centro_Italia_del_2016_e_del_2017). is...
    • ZIP
      The resource: 'EAQ-AMA.zip' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    European Banks Asset Class exposures

    This is a curated dataset, where the Original data are taken from European Banking Authority (EBA), who collects banks' data to perform stress-test systemic risk analysis....
    • The resource: 'data-link' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Retail Market Data

    This dataset contains Retail Market Data about food products, from 2007, for about 130 shops of an Italian Distribution chain. Data are of about 1 M of Active Clients, and...
  • SoBigData.eu: Dataset

    DBLP Network

    The DBLP computer science bibliography provides a comprehensive list of research papers in computer science. This dataset is a co-authorship network constructed upon the DBLP...
    • HTML
      The resource: 'DBLP Network' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Facebook EuroSys 2009

    This dataset contains Social and interaction graphs representing two large-scale Facebook regional networks. Social graphs describe Facebook friendships between users...
    • The resource: 'The Facebook EuroSys'09 ...' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    NYSE transactions

    This dataset contains financial data on the price of the top 250 most liquid assets of New York Stock Exchange (NYSE) from 2006 to 2014. The dataset contains transactions,...
  • SoBigData.eu: Dataset

    Emergency Tweets 2013 Sardinia flood

    This dataset is related to the floods that occurred in the Sardinia regional district between 17 and 19 November 2013 (https://en.wikipedia.org/wiki/2013_Sardinia_floods), as...
    • ZIP
      The resource: 'FLO-SAR.zip' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    IMDB Network

    Network built upon the entire IMDb database
    • The resource: 'API interfaces' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    City-to-city migration

    Census data recording the migration of people between metropolitan areas in the US
  • SoBigData.eu: Dataset

    Facebook Wallpost

    Online interactions between users via the wall feature in the New Orleans regional network.
    • HTML
      The resource: 'Original data' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    FED data

    March 2001- September 2013 quarterly data of US banks' holdings. The number of financial institutions present in the data is pretty stable during quarters, starting from...
  • SoBigData.eu: Dataset

    bond yield_equity log-returns_CDS spreads

    Financial data used to construct a bipartite network of systemically important banks and sovereign bonds.
  • SoBigData.eu: Dataset

    WEIBO interactions

    This dataset is obtained from the 2012 WISE Challenge: built upon the logs of the popular Chinese micro-blog service WEIBO, its interactions represent mentions of users in...
    • HTML
      The resource: 'Original data' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Sheffield NERD Tweet Corpus

    The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes...
    • FINF
      The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    ClueWeb12

    The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information...
  • SoBigData.eu: Dataset

    Emergency Tweets 2009 L'Aquila earthquake

    This dataset comprises 1,100 Italian tweets shared in the aftermath of the 2009 L’Aquila earthquake (https://en.wikipedia.org/wiki/2009_L%27Aquila_earthquake). The earthquake...
    • ZIP
      The resource: 'EAQ-LAQ.zip' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Call Data Record Pisa 2012

    The dataset contains mobile phone records collected in the province of Pisa in February 2012. It contains about 8 mln of Call Data Records (CDRs) of about 230.000 phone users,...
You can also access this registry using the API (see API Docs).