9 items found

Tags: Web mining

Filter Results
  • SoBigData.eu: TrainingMaterial

    Introduction to Data Curation

    This course is an introduction to data collection, data preparation & transformation and data analysis. It contains the essential concepts for a researcher in order to...
    • PDF
      The resource: 'Introduction to Data Curation' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Experiment

    Forecasting the market value of soccer players from soccer-logs and social me...

    This experiments aims to develop a methodology to monitor and predict the market value of professional soccer players given their performance computed from soccer-logs and...
  • Access required...

    ×

    SoBigData.eu: TrainingMaterial

    Private Archive Crawling

    Web archives are typically very broad in scope and extremely large in scale. This makes data analysis appear daunting, especially for non-computer scientists. These...
  • Access required...

    ×

    SoBigData.eu: TrainingMaterial

    Private High Performance and Scalable Analytics Module

    Mining with big data or big data mining has become an active research area. Running current analytical methodologies and software tools on a single personal computer cannot...
  • Access required...

    ×

    SoBigData.eu: TrainingMaterial

    Private Archive Spark

    An Apache Spark framework for easy data processing, extraction as well as derivation for archival collections. Originally developed for the use with Web archives, it has now...
  • Access required...

    ×

    SoBigData.eu: TrainingMaterial

    Private Data Mining and Machine Learning Module

    The module provides an introduction to base concepts of data mining and knowledge extraction process, introducing analytical models and algorithms for clustering,...
  • Access required...

    ×

    SoBigData.eu: TrainingMaterial

    Private GATE Course

    The material is the 2017 version of a week-long training course delivered annually by the GATE team. Over almost ten years, this course has been developed to provide basic and...
  • SoBigData.eu: Dataset

    Wikipedia Word Embeddings

    Embeddings were created through applying word2vec skipgram to a corpus of wikipedia non-stub articles from a December 2015 English dump with the following parameters: -cbow 0...
    • The resource: 'Embeddings' is not accessible as guest user. You must login to access it!
  • SoBigData.eu: Method

    Dictionary creator

    This tool creates a dictionary with inverse document frequency (idf) values from the Google NGrams dataset.
    • The resource: 'Source code' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).