-
Air Quality Datasets over L'Aquila Region
These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData. -
Private Post-earthquake Reconstruction Progress Datasets over L'Aquila City
Reconstruction data sets, provided by the National Public Entities of USRA and USRC. These data sets are stored in CSV files and provide comprehensive information related to... -
Introduction to Data Curation
This course is an introduction to data collection, data preparation & transformation and data analysis. It contains the essential concepts for a researcher in order to...-
PDF
The resource: 'Introduction to Data Curation' is not accessible as guest user. You must login to access it!
-
PDF
-
ClueWeb09
The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on... -
Global Peace Index data
A dataset of the Global Peace Index (GPI), which ranks 163 independent states and territories according to their level of peacefulness. The GPI covers 99.7 per cent of the... -
NYSE transactions
This dataset contains financial data on the price of the top 250 most liquid assets of New York Stock Exchange (NYSE) from 2006 to 2014. The dataset contains transactions,... -
FED data
March 2001- September 2013 quarterly data of US banks' holdings. The number of financial institutions present in the data is pretty stable during quarters, starting from... -
Retail market dataset
The dataset contains purchases of Unicoop Tirreno customers, description and information of the shops (both small shops and supermarkets) and the customers. -
Retail Market Data
This dataset contains Retail Market Data about food products, from 2007, for about 130 shops of an Italian Distribution chain. Data are of about 1 M of Active Clients, and... -
Russell 3000 stock prices
This dataset contains the price and volume of the 3000 stocks belonging to the Russell 3000 Index, roughly corresponding to the 3000 more capitalized stocks. Traded volume and... -
Articles and comments of major Estonian newspapers
The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016. -
ClueWeb12
The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information... -
German Academic Web
The dataset contains regular crawls of the websites for German academic institutions. -
MSN Search query log
The data consists of an MSN Search query log excerpt with 15 million queries, from US users, sampled over one month of activity. Data attributes made available per query: 1)... -
CoPhIR
The CoPhIR (Content-based Photo Image Retrieval) Test-Collection has been developed to make significant tests on the scalability of the SAPIR project infrastructure (SAPIR:... -
Interactive Learning Environments
King’s College London developed a variety of data science materials based on R and Python. R is a de facto standard in statistical computing and visualisation, while our... -
Efficiency - Effectiveness Trade-offs in Learning to Rank
This tutorial provides an 'Introduction to Learning to Rank' and focuses on 'Dealing with the Efficiency/Effectiveness trade-off in Web Search'. Moreover, it provides two...-
PDF
The resource: 'Introduction to Learning ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Dealing with the ...' is not accessible as guest user. You must login to access it!
-
python
The resource: 'Hands-on Session 1 ' is not accessible as guest user. You must login to access it!
-
python
The resource: 'Hands-on Session 2 ' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Publicly available ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Istella Learning to Rank ...' is not accessible as guest user. You must login to access it!
-
PDF
-
Jupyter Notebooks
King’s College London has developed complete stories around Jupyter Notebooks that form easy recipes for reproducible methods in social data science. Jupyter...-
ZIP
The resource: 'Historical Cultures Repository' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Prediction Modelling ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Social and Cultural ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Social Sensing Repository' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Visual Arts Repository' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Ananke Guide' is not accessible as guest user. You must login to access it!
-
mp4
The resource: 'Ananke Guide Video' is not accessible as guest user. You must login to access it!
-
ZIP