-
Private Jupyter Notebooks
King’s College London has developed complete stories around Jupyter Notebooks that form easy recipes for reproducible methods in social data science. Jupyter... -
Private Efficiency - Effectiveness Trade-offs in Learning to Rank
This tutorial provides an 'Introduction to Learning to Rank' and focuses on 'Dealing with the Efficiency/Effectiveness trade-off in Web Search'. Moreover, it provides two... -
Private Archive Crawling
Web archives are typically very broad in scope and extremely large in scale. This makes data analysis appear daunting, especially for non-computer scientists. These... -
Private Text Analytics and Opinion Mining Module
The goals of this module are: - Have a general knowledge of text mining problems and methods. - Recognize situations in which Sentiment Analysis techniques can solve... -
Private Database Module
The 'Database Module' aims to introduce database analysis, focusing on DBMS architecture, Relational Models, SQL language and SQL nested queries. It is part of the Master in... -
Private Interactive Learning Environments
King’s College London developed a variety of data science materials based on R and Python. R is a de facto standard in statistical computing and visualisation, while our... -
Private SOS Online Abuse of Politicians
The material is a 20 minute video presentation describing the GATE team's work investigating online abuse of UK politicians. The video comprises slides and a voice-over. The... -
Private High Performance and Scalable Analytics Module
Mining with big data or big data mining has become an active research area. Running current analytical methodologies and software tools on a single personal computer cannot... -
Private Data Management for Business Intelligence Module
This module provides an introduction to information storage and management performed in order to support business decisions of organizations. It is part of the Master in Big... -
Private Data Visualisation and Visual Analytics Module
This module provides an introduction to the concepts of vision and perception in order to design an effective data visualisation. Moreover, it provides insight into visual... -
Private Data Mining and Machine Learning for Social Science
An introductory course for data mining and machine learning for social science. The course focuses on presenting typical data mining and machine learning techniques by using a... -
Private Visual Analytics for Data Scientists
Participants to this module shall - Learn the principles and rules underlying the design of visual data representations and human-computer interactions - Understand,... -
Private Data Mining and Machine Learning Module
The module provides an introduction to base concepts of data mining and knowledge extraction process, introducing analytical models and algorithms for clustering,... -
Introduction to Data Curation
This course is an introduction to data collection, data preparation & transformation and data analysis. It contains the essential concepts for a researcher in order to...-
PDF
The resource: 'Introduction to Data Curation' is not accessible as guest user. You must login to access it!
-
PDF
-
Privacy Risk on Trajectories
This method provides a Privacy Risk Assessment on mobility data, in terms of trajectories or aggregation of trajectories, i.e., locations with frequency of visit and locations...-
Python
The resource: 'privacy-lib' is not accessible as guest user. You must login to access it!
-
Python
-
Aalto-Foursquare
The dataset consists of about 15 million of tweets which point to public Foursquare check-ins. -
Twitter Dataset 2013-2014
The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st... -
Amazon Network
Network was collected by crawling Amazon website. It is based on Customers Who Bought This Item Also Bought feature of the Amazon website. If a product i is frequently...-
HTML
The resource: 'Amazon Network ' is not accessible as guest user. You must login to access it!
-
HTML
-
Wikinews dataset
This dataset consists of a sample of 365 news published by Wikinews from November 2004 to June 2014 and annotated with about 5000 entities, each associated with a saliency...-
JSON
The resource: 'entity-saliency' is not accessible as guest user. You must login to access it!
-
JSON