-
Emergency Tweets 2013 Sardinia flood
This dataset is related to the floods that occurred in the Sardinia regional district between 17 and 19 November 2013 (https://en.wikipedia.org/wiki/2013_Sardinia_floods), as...-
ZIP
The resource: 'FLO-SAR.zip' is not accessible as guest user. You must login to access it!
-
ZIP
-
Emergency Tweets 2009 L'Aquila earthquake
This dataset comprises 1,100 Italian tweets shared in the aftermath of the 2009 L’Aquila earthquake (https://en.wikipedia.org/wiki/2009_L%27Aquila_earthquake). The earthquake...-
ZIP
The resource: 'EAQ-LAQ.zip' is not accessible as guest user. You must login to access it!
-
ZIP
-
Emergency Tweets 2013 Milan blackout
This dataset is related to a power outage (i.e., a blackout) that occurred in the city of Milan, in northern Italy, in the night between 14 and 15 May 2013. Despite not...-
CSV
The resource: 'PWO-MIL_tweets.csv' is not accessible as guest user. You must login to access it!
-
CSV
-
Emergency Tweets 2011 Christchurch earthquake
This dataset contains tweets related to the devastating earthquake occurred on 22 February 2011, at around 12 p.m. local time in Christchurch, New Zealand...-
CSV
The resource: 'EAQ-CHR_tweets.csv' is not accessible as guest user. You must login to access it!
-
CSV
-
-
ZIP
The resource: 'geo-annotated tweets.zip' is not accessible as guest user. You must login to access it!
-
ZIP
-
Emergency Tweets 2014 Genoa flood
This dataset contains Italian tweets collected during and in the aftermath of the floods that occurred near the city of Genoa between 9 and 11 October 2014...-
ZIP
The resource: 'FLO-GEN.zip' is not accessible as guest user. You must login to access it!
-
ZIP
-
Emergency Tweets 2012 Emilia earthquake
This dataset contains 3,170 Italian tweets about the earthquakes that stroke the Emilia Romagna regional district in Italy on 20 May 2012 starting from 4 a.m. local time...-
ZIP
The resource: 'EAQ-EML.zip' is not accessible as guest user. You must login to access it!
-
ZIP
-
Twitter dataset about two premier UK music festivals
The dataset contains twitter posts about two premier UK music festivals: Creamfields 2016 (on August 25th-28th) and VFestival 2016 (on August 20th-21st).-
Github
The resource: 'Twitter dataset about two ...' is not accessible as guest user. You must login to access it!
-
Github
-
Twitter fake followers
Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the... -
Twitter social bots
Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,... -
-
HTML
The resource: 'Link to the library' is not accessible as guest user. You must login to access it!
-
HTML
-
Python library for direct and indirect discrimination prevention in data mining
This python library implements the discrimination discovery and prevention method proposed in the paper: “A methodology for direct and indirect discrimination prevention in...-
GitHub
The resource: 'Link to library' is not accessible as guest user. You must login to access it!
-
GitHub
-
GSP - Geo-Semantic-Parsing
GSP receives a text document as input and returns an enriched document, where all mentions of places/locations are associated to the corresponding geographic coordinates. To... -
Pluralistic Recommendation in News - Report
Report on the Humane-AI microproject "Pluralistic Recommendation in News". It contains details on how the two relevant datasets were built, a link to the two, and details...-
PDF
The resource: 'Humane_ai_Report' is not accessible as guest user. You must login to access it!
-
PDF
-
Annotazione semantica di delibere comunali
Progetto POC per l'uso delle tecniche di text mining su documenti della pubblica amministrazione per migliorare la trasparenza e l’accesso alle informazioni da parte dei... -
Gate Cloud
This is the new improved version of the GATE Cloud platform originally launched in 2011 to provide end-to-end text processing solutions from the GATE family running on cloud...-
HTML
The resource: 'Gate Cloud Methods' is not accessible as guest user. You must login to access it!
-
HTML
The resource: 'Gate Cloud Twitter Collector' is not accessible as guest user. You must login to access it!
-
HTML
The resource: 'Brexit analyser' is not accessible as guest user. You must login to access it!
-
HTML
-
Private Distributed W2V
Accelerated training of Word Embeddings for large text corpora. Creates a word2vec-model from an input corpus of tokenized texts through the use of parallel distributed... -
The Italian Music Dataset
The dataset is built by exploiting the Spotify and SoundCloud APIs. It is composed of over 14,500 different songs of both famous and less famous Italian musicians. Each song...-
JSON
The resource: 'Dataset' is not accessible as guest user. You must login to access it!
-
JSON
-
Conversational search dataset with labels
CAsT 2019 data is split into two files one for training and the other one for testing. - Training set: CAsT 2019 conversations from training set and from test set without... -
Dataset for Evaluating Abstractive Summaries of Crisis-Related Social Media
The dataset created for evaluation of summaries generated from social media posted during five natural disasters. The dataset contains: ground truth reports created by human...