47 items found

Types: SoBigData.eu: Dataset

Filter Results
  • SoBigData.eu: Dataset

    ISTAT Census zone Tuscany

    Geometry of census sector and limited demographic information. Nr. of sectors = About 20.000
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private e-MID dataset

    e-MID is the Italian Electronic Market for Interbank Deposits. Data consist of transactions between banks participating to the market. For each transaction, the following...
  • SoBigData.eu: Dataset

    Flickr and Wikipedia Tourism Trajectories

    The dataset contains a knowledge base built with data coming from Flickr and Wikipedia. It covers three Italian cities which are important from a sightseeing point of view and...
  • SoBigData.eu: Dataset

    CoPhIR

    The CoPhIR (Content-based Photo Image Retrieval) Test-Collection has been developed to make significant tests on the scalability of the SAPIR project infrastructure (SAPIR:...
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private Disease Twitter Dataset

    This Twitter dataset covers two recent outbreaks: Ebola and Zika. About 60 million tweets were collected through a query-based access to the Twitter Streaming API, covering...
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private Twitter Dataset 2013-2014

    The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st...
  • SoBigData.eu: Dataset

    Scientific Publications Dataset

    SciMAG 2015 dataset including publication and citation data
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private e-MID interbank transactions

    Edgelist containing daily interbank transactions as registered in the electronic Market for Interbank Deposits (e-MID), in the period 2010--2014. e-MID is one of the largest...
  • SoBigData.eu: Dataset

    Word Sense Evolution Testset

    This testset consists of 23 terms which have experienced word sense change during the past centuries. The main changes for each term were found using Wikipedia, dictionary.com...
  • SoBigData.eu: Dataset

    Social Network dataset - LiveJournal

    LiveJournal is a free on-line blogging community where users declare friendship each other. LiveJournal also allows users form a group which other members can then join. We...
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private Russell 3000 stock prices

    Data on the price and volume of the 3000 stocks belonging to the Russell 3000 Index, roughly corresponding to the 3000 more capitalized stocks. Traded volume and high, low,...
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private Retail market dataset

    The dataset contains purchases of Unicoop Tirreno customers, description and information of the shops (both small shops and supermarkets) and the customers.
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private Retail Market Data

    Retail Market Data about food products, from 2007, for stores of an Italian Distribution chain. About 130 Shops, about 1 M Active Clients, 450K Different Products,280M...
  • SoBigData.eu: Dataset

    Official administrative information of Tuscany

    The data contains the spatial partitioning of Tuscany and some statistical information published by the italian statistical bureau
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private NYSE transactions

    Financial data on the price of the top 250 most liquid assets of New York Stock Exchange (NYSE) from 2006 to 2014. The dataset contains transactions, quotes and market depth...
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private MSN Search query log

    The data consists of an MSN Search query log excerpt with 15 million queries, from US users, sampled over one month of activity. Data attributes made available per query: 1)...
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private HPC Twitter Dumps

    The dataset consists of the 10% of the daily stream of tweets produced on Twitter filtered into 3 subsets: English, Italian, geo-referenced. The tweets are a random sample of...
  • SoBigData.eu: Dataset

    GPS Origin Destination Matrix in Tuscany

    Extraction of the origin and destination matrix among the municipalities of Tuscany
  • SoBigData.eu: Dataset

    GERDAQ Dataset

    This is a benchmark dataset of annotated search-engine queries. Mentions of entities in search-engine queries are tagged with the entity they refer to. Wikipedia is used as...
  • Access required...

    ×

    SoBigData.eu: Dataset

    Private German Academic Web

    Regular crawls of the websites for German academic institutions
You can also access this registry using the API (see API Docs).