275 items found

Filter Results
  • Dataset

    Russell 3000 stock prices

    This dataset contains the price and volume of the 3000 stocks belonging to the Russell 3000 Index, roughly corresponding to the 3000 more capitalized stocks. Traded volume and...
  • Dataset

    Formal network of Estonian companies and board members

    This dataset consists of managed and continuously updated data about Estonian companies and board members since 1994. Technical documentation of data structures and the REST API...
  • Dataset

    Estonian public sector electronic services and service providers and consumers

    The dataset contains records of electronic services (aka X-Road services), service providers and consumers harvested in April 2014 from RIHA (https://riha.eesti.ee). The data...
  • Dataset

    Brexit Tweets Linked Domains

    In this spreadsheet we share domains linked in the UK EU membership referendum tweet collection. Counts for links by leave voters and remain voters are given, enabling sites...
    • ODS
      The resource: 'Brexit Tweets Linked ...' is not accessible as guest user. You must login to access it!
  • Dataset

    DE webarchive

    The dataset consists of all the content from the .de top level domain as crawled by the Internet Archive.
    • HTML
      The resource: 'Internet Archive Wayback ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Brexit Twitter User Vote Intent

    A list of users for which vote intent in the UK EU membership referendum has been established.
  • Dataset

    UK General Election Vote Intent

    A list of Twitter users for whom party political allegiance/vote intent has been established.
  • Dataset

    Social Network dataset - LiveJournal

    LiveJournal is a free on-line blogging community where users declare friendship each other. LiveJournal also allows users form a group which other members can then join. We...
    • HTML
      The resource: 'LiveJournal social network ...' is not accessible as guest user. You must login to access it!
  • Dataset

    .ee Web archive

    .ee Web archive consisting of snapshots from 2015
  • Dataset

    Broad Twitter Corpus

    The Broad Twitter Corpus is a named entity-annotated dataset of tweets, collected in order to capture temporal, spatial and social diversity. The goal of the corpus is to...
    • JSON
      The resource: 'Broad Twitter Corpus' is not accessible as guest user. You must login to access it!
  • Dataset

    UK election abuse data

    The GATE team (gate.ac.uk) at the University of Sheffield have collected 1.4 million tweets sent to and by UK members of parliament in the months leading up to the 2015 and...
    • XLS
      The resource: 'uk-election-abuse.tar.gz' is not accessible as guest user. You must login to access it!
  • Dataset

    Twitter Dataset 2013-2014

    The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st...
  • Dataset

    Articles and comments of major Estonian newspapers

    The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016.
  • Dataset

    ClueWeb12

    The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information...
  • Dataset

    Sheffield NERD Tweet Corpus

    The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes...
    • FINF
      The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
  • Dataset

    GPS Tracks - Tuscany 2011

    This dataset contains GPS trajectories of private vehicles crossing the region of Tuscany in Italy. It is composed of about 11 mln of trips of 150.000 users collected in May...
  • Dataset

    GeoLife - GPS trajectories dataset

    This (link to a) GPS trajectory dataset was collected in (Microsoft Research Asia) Geolife project by 182 users in a period of over three years (from April 2007 to August 2012)....
    • ZIP
      The resource: 'GeoLife Download page' is not accessible as guest user. You must login to access it!
  • Dataset

    Aalto-Twitter

    The dataset consists of about 418 million of tweets from June 25, 2015 to September 19, 2015. Tweets are about trending hashtags gathered though the public Twitter api.
  • Dataset

    Aalto-Foursquare

    The dataset consists of about 15 million of tweets which point to public Foursquare check-ins.