-
DE webarchive
The dataset consists of all the content from the .de top level domain as crawled by the Internet Archive.-
HTML
The resource: 'Internet Archive Wayback ...' is not accessible as guest user. You must login to access it!
-
HTML
-
Brexit Twitter User Vote Intent
A list of users for which vote intent in the UK EU membership referendum has been established. -
UK General Election Vote Intent
A list of Twitter users for whom party political allegiance/vote intent has been established. -
Social Network dataset - LiveJournal
LiveJournal is a free on-line blogging community where users declare friendship each other. LiveJournal also allows users form a group which other members can then join. We...-
HTML
The resource: 'LiveJournal social network ...' is not accessible as guest user. You must login to access it!
-
HTML
-
Broad Twitter Corpus
The Broad Twitter Corpus is a named entity-annotated dataset of tweets, collected in order to capture temporal, spatial and social diversity. The goal of the corpus is to...-
JSON
The resource: 'Broad Twitter Corpus' is not accessible as guest user. You must login to access it!
-
JSON
-
UK election abuse data
The GATE team (gate.ac.uk) at the University of Sheffield have collected 1.4 million tweets sent to and by UK members of parliament in the months leading up to the 2015 and...-
XLS
The resource: 'uk-election-abuse.tar.gz' is not accessible as guest user. You must login to access it!
-
XLS
-
Twitter Dataset 2013-2014
The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st... -
Articles and comments of major Estonian newspapers
The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016. -
ClueWeb12
The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information... -
Sheffield NERD Tweet Corpus
The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes...-
FINF
The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
-
FINF
-
-
ZIP
The resource: 'dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
Covid infodemic in Italy -- Most retweeted accounts
Top 10 most retweeted accounts on Covid-related keywords, between Jan 30 and Mar 20, 2020.-
ZIP
The resource: 'dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
Twitter Dataset British MPs
This dataset contains the Twitter tweet_ids from the Timelines of 584 members of British Parliament (collected between 4th and 6th of March 2022). The users are identified from...-
TSV
The resource: 'Twitter Dataset British MPs' is not accessible as guest user. You must login to access it!
-
TSV
-
A dataset of journalists on Twitter
This dataset comprises the Twitter timelines of journalists belonging to 17 different countries from 8 different continental regions, downloaded in May 2018. We used the Twitter...-
HTML
The resource: 'Journalists dataset' is not accessible as guest user. You must login to access it!
-
HTML
-
Private Visualising community detection in networks
A series of experimental techniques to visualize relational communities and their blurred boundaries in networks -
Workshopping on social big data and adversarial publics
This report is based on an empirical case study on content moderation, whose objective is to interrogate whether content moderation can be reclaimed by users as a practice...-
PDF
The resource: 'Report and workshop protocol' is not accessible as guest user. You must login to access it!
-
PDF
-
Polarizing opinion vectors with the Friedkin-Johnsen Model
This code contains two Mathematica notebooks to find the polarizing opinion vectors given the social graph and the nodes’ susceptibility. The notebooks have to be saved in the...-
HTML
The resource: 'Polarizing ...' is not accessible as guest user. You must login to access it!
-
HTML
-
Ephemerality metric
https://github.com/HPAI-BSC/ephemerality Code for calculating the ephemerality metrics that can be used to estimate how "ephemeral" discussion topics are based on their...-
ZIP
The resource: 'ephemerality-main' is not accessible as guest user. You must login to access it!
-
ZIP
-
Distance Calculator
The program is intended for calculating semantic distances between input texts. As a commandline script it takes a list of tab-separated text pairs (line-per-pair) and returns...-
ZIP
The resource: 'Code' is not accessible as guest user. You must login to access it!
-
ZIP