-
Brexit Tweets Linked Domains
In this spreadsheet we share domains linked in the UK EU membership referendum tweet collection. Counts for links by leave voters and remain voters are given, enabling sites...-
ODS
The resource: 'Brexit Tweets Linked ...' is not accessible as guest user. You must login to access it!
-
ODS
-
DE webarchive
The dataset consists of all the content from the .de top level domain as crawled by the Internet Archive.-
HTML
The resource: 'Internet Archive Wayback ...' is not accessible as guest user. You must login to access it!
-
HTML
-
Brexit Twitter User Vote Intent
A list of users for which vote intent in the UK EU membership referendum has been established. -
UK General Election Vote Intent
A list of Twitter users for whom party political allegiance/vote intent has been established. -
Social Network dataset - LiveJournal
LiveJournal is a free on-line blogging community where users declare friendship each other. LiveJournal also allows users form a group which other members can then join. We...-
HTML
The resource: 'LiveJournal social network ...' is not accessible as guest user. You must login to access it!
-
HTML
-
Broad Twitter Corpus
The Broad Twitter Corpus is a named entity-annotated dataset of tweets, collected in order to capture temporal, spatial and social diversity. The goal of the corpus is to...-
JSON
The resource: 'Broad Twitter Corpus' is not accessible as guest user. You must login to access it!
-
JSON
-
UK election abuse data
The GATE team (gate.ac.uk) at the University of Sheffield have collected 1.4 million tweets sent to and by UK members of parliament in the months leading up to the 2015 and...-
XLS
The resource: 'uk-election-abuse.tar.gz' is not accessible as guest user. You must login to access it!
-
XLS
-
Twitter Dataset 2013-2014
The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st... -
Articles and comments of major Estonian newspapers
The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016. -
ClueWeb12
The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information... -
Sheffield NERD Tweet Corpus
The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes...-
FINF
The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
-
FINF
-
GPS Tracks - Tuscany 2011
This dataset contains GPS trajectories of private vehicles crossing the region of Tuscany in Italy. It is composed of about 11 mln of trips of 150.000 users collected in May... -
GeoLife - GPS trajectories dataset
This (link to a) GPS trajectory dataset was collected in (Microsoft Research Asia) Geolife project by 182 users in a period of over three years (from April 2007 to August 2012)....-
ZIP
The resource: 'GeoLife Download page' is not accessible as guest user. You must login to access it!
-
ZIP
-
Aalto-Twitter
The dataset consists of about 418 million of tweets from June 25, 2015 to September 19, 2015. Tweets are about trending hashtags gathered though the public Twitter api. -
Aalto-Foursquare
The dataset consists of about 15 million of tweets which point to public Foursquare check-ins. -
Open data from NervousNet
This dataset contains anonymized proximity information sent by 154 mobile phones (both Android and iPhone) via phone apps. These information are sent by bluetooth beacons every...-
ZIP
The resource: 'open data from NervousNet' is not accessible as guest user. You must login to access it!
-
ZIP
-
Micro Project Datasets: Academic Migration and Academic Networks
Datasets used and produced for and from the micro project titled: Academic Migration and Academic Networks: Evidence from Scholarly Big Data and the Iron Curtain-
HTML
The resource: 'Micro Project Datasets' is not accessible as guest user. You must login to access it!
-
HTML
-
Activity data from the Covid19 period
Activity data from Telia telecommunications company, Finland reports the number of people dwelling in area for a certain amount of time. More precisely, activity count...