92 items found

Types: SoBigData.eu: Dataset

Filter Results
  • SoBigData.eu: Dataset

    Wikinews dataset

    This dataset consists of a sample of 365 news published by Wikinews from November 2004 to June 2014 and annotated with about 5000 entities, each associated with a saliency...
    • JSON
      The resource: 'entity-saliency' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Open data from NervousNet

    This dataset contains anonymized proximity information sent by 154 mobile phones (both Android and iPhone) via phone apps. These information are sent by bluetooth beacons every...
    • ZIP
      The resource: 'open data from NervousNet' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    GPS Origin Destination Matrix in Tuscany

    This dataset is the origin and destination matrix among the municipalities of Tuscany extracted starting from GPS tracks of private vehicles collected from 2014-02-10 to...
    • CSV
      The resource: ' GPS Origin Destination Matrix' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Twitter fake followers

    Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the...
  • SoBigData.eu: Dataset

    Emergency Tweets 2013 Milan blackout

    This dataset is related to a power outage (i.e., a blackout) that occurred in the city of Milan, in northern Italy, in the night between 14 and 15 May 2013. Despite not...
    • CSV
      The resource: 'PWO-MIL_tweets.csv' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Emergency Tweets 2011 Christchurch earthquake

    This dataset contains tweets related to the devastating earthquake occurred on 22 February 2011, at around 12 p.m. local time in Christchurch, New Zealand...
    • CSV
      The resource: 'EAQ-CHR_tweets.csv' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Soccer Events

    This dataset contains data regarding one full season of soccer games. For each player there are locations (positions in pitch) visited and all the events they generated...
    • ZIP
      The resource: 'Soccer event data' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Geo-annotated tweets ENG-ITA

    • ZIP
      The resource: 'geo-annotated tweets.zip' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Twitter social bots

    Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,...
  • SoBigData.eu: Dataset

    Emergency Tweets 2014 Genoa flood

    This dataset contains Italian tweets collected during and in the aftermath of the floods that occurred near the city of Genoa between 9 and 11 October 2014...
    • ZIP
      The resource: 'FLO-GEN.zip' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    ISTAT Census zone Tuscany

    This dataset contains the geometry of about 20.000 census sectors and limited demographic information of Tuscany region (Italy).
    • ZIP
      The resource: 'Istat Dataset ' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Congress Network

    Network built on top of US congress voting data and made available on the website GovTrack.us. Nodes identifies congressman and edges represent the semantic "have supported the...
    • The resource: 'Original data' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    e-MID dataset

    e-MID is the Italian Electronic Market for Interbank Deposits. Data consist of transactions between banks participating to the market. For each transaction, the following...
  • SoBigData.eu: Dataset

    Amazon reviews

    This dataset contains product reviews and metadata from Amazon, including 142.8 million reviews spanning May 1996 - July 2014. This dataset includes reviews (ratings, text,...
    • HTML
      The resource: 'Julian McAuley's repository.' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Estonian public sector electronic services and service providers and consumers

    The dataset contains records of electronic services (aka X-Road services), service providers and consumers harvested in April 2014 from RIHA (https://riha.eesti.ee). The data...
  • SoBigData.eu: Dataset

    Articles and comments of major Estonian newspapers

    The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016.
  • SoBigData.eu: Dataset

    Amazon Network

    Network was collected by crawling Amazon website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently...
    • HTML
      The resource: 'Amazon Network ' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    Twitter Dataset 2013-2014

    The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st...
  • SoBigData.eu: Dataset

    UK election abuse data

    The GATE team (gate.ac.uk) at the University of Sheffield have collected 1.4 million tweets sent to and by UK members of parliament in the months leading up to the 2015 and...
    • XLS
      The resource: 'uk-election-abuse.tar.gz' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Dataset

    GPS Tracks - Tuscany 2011

    This dataset contains GPS trajectories of private vehicles crossing the region of Tuscany in Italy. It is composed of about 11 mln of trips of 150.000 users collected in May...
You can also access this registry using the API (see API Docs).