-
CDR data - Tuscany
The dataset contains mobile phone records collected in Tuscany between September 2015 and August 2016. It contains Call Data Records (CDRs) of phone users, and the corresponding... -
GPS Tracks - Calabria, Italy 2012
The dataset consists of GPS tracks of private vehicles collected in Calabria region (Italy). It counts about 28 mln of trajectories of about 115.000 users. Data are in the... -
Flickr and Wikipedia Tourism Trajectories
The dataset contains a knowledge base built with data coming from Flickr and Wikipedia. It covers three Italian cities which are important from a sightseeing point of view and...-
ZIP
The resource: 'TripBuilder' is not accessible as guest user. You must login to access it!
-
ZIP
-
ClueWeb09
The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on... -
Twitter fake followers
Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the... -
Twitter social bots
Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,... -
Wyscout soccer-logs dataset
A dataset of soccer-logs for all the main soccer leagues in the world, from season 2014/2015 to the current one. -
Global Peace Index data
A dataset of the Global Peace Index (GPI), which ranks 163 independent states and territories according to their level of peacefulness. The GPI covers 99.7 per cent of the... -
Food consumption data at the canteens of University of Pisa
A dataset storing all the meals consumed by students at the canteen of University of Pisa during a six years-long period. -
ClueWeb12
The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information... -
Injury forecaster for soccer players
An algorithm to forecast the injuries of soccer players given their training workload -
Soccer teams ranking simulator
This algorithm simulates the outcomes of an entire season of each team of a football league only relying on technical data (i.e., excluding the goals scored), by exploiting a... -
Digital DNA fingerprinting
The "Digital DNA fingerprinting" is a spambot detection technique based on the "Digital DNA" online behavioral modeling technique. Given a set of Twitter user timelines, it is... -
Human Mobility Data Privacy Risk Estimator
This method is a fast and flexible approach to estimate privacy risk in human mobility data. The idea is to train classifiers to capture the relation between individual... -
Diary-based Trajectory Generator
Ditras (DIary-based TRAjectory Simulator) is a framework to simulate the spatio-temporal patterns of human mobility. It operates in two steps: the generation of a mobility... -
GSP - Geo-Semantic-Parsing
GSP receives a text document as input and returns an enriched document, where all mentions of places/locations are associated to the corresponding geographic coordinates. To... -
MSN Search query log
The data consists of an MSN Search query log excerpt with 15 million queries, from US users, sampled over one month of activity. Data attributes made available per query: 1)... -
CoPhIR
The CoPhIR (Content-based Photo Image Retrieval) Test-Collection has been developed to make significant tests on the scalability of the SAPIR project infrastructure (SAPIR:...