-
SWH Filenames
A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...-
ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
-
ZIP
-
Introduction to Data Curation
This course is an introduction to data collection, data preparation & transformation and data analysis. It contains the essential concepts for a researcher in order to...-
PDF
The resource: 'Introduction to Data Curation' is not accessible as guest user. You must login to access it!
-
PDF
-
Wikipedia Word Embeddings
Embeddings were created through applying word2vec skipgram to a corpus of wikipedia non-stub articles from a December 2015 English dump with the following parameters: -cbow 0... -
High Performance and Scalable Analytics Module
Mining with big data or big data mining has become an active research area. Running current analytical methodologies and software tools on a single personal computer cannot...-
PDF
The resource: 'Introduction to Parallel ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Introduction to Hadoop' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Hadoop Patterns' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Remote Connection and HDFS' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises for Remote ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Introduction to Spark' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises for Introduction ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Introduction to Spark SQL' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises for Introduction ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Hadoop Ecosystem and ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Data Mining with Spark (MLLIB)' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises for Data Mining ...' is not accessible as guest user. You must login to access it!
-
PDF