129 items found

Tags: Text mining

Filter Results
  • Method

    Ariadne Swedish Dendrochronology Entity Recognizer

    Identifies terms and phrases in Swedish for analysing archaeological text. The method delivers named entities of archaeological elements, wood material, sample, and date, with...
    • method-engine
      The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
  • Method

    GATE Cloud Chemical Entity Recogniser

    This service annotates chemical named entities using the open source OSCAR4 tagger. As well as the names of the detected entities the tagger also returns their structure in...
    • method-engine
      The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
  • Dataset

    Wikinews dataset

    This dataset consists of a sample of 365 news published by Wikinews from November 2004 to June 2014 and annotated with about 5000 entities, each associated with a saliency...
    • JSON
      The resource: 'entity-saliency' is not accessible as guest user. You must login to access it!
  • Dataset

    The Italian Music Dataset

    The dataset is built by exploiting the Spotify and SoundCloud APIs. It is composed of over 14,500 different songs of both famous and less famous Italian musicians. Each song...
    • JSON
      The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Method

    ArchiveSpark

    ArchiveSpark is an Apache Spark framework for easy data access, processing, extraction as well as derivation for Web archives and archival collections. It has a simple and...
    • The resource: 'ArchiveSpark on GitHub' is not accessible as guest user. You must login to access it!
  • Method

    Dictionary creator

    This tool creates a dictionary with inverse document frequency (idf) values from the Google NGrams dataset.
    • The resource: 'Source code' is not accessible as guest user. You must login to access it!
  • Dataset

    WIRE dataset

    This dataset consists of 503 pairs of Wikipedia entities drawn from the New York Times dataset with a human assigned relatedness score. The domain experts based their...
    • HTML
      The resource: 'WikipediaRelatedness' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'WIRE dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Wikipedia Word Embeddings

    Embeddings were created through applying word2vec skipgram to a corpus of wikipedia non-stub articles from a December 2015 English dump with the following parameters: -cbow 0...
    • The resource: 'Embeddings' is not accessible as guest user. You must login to access it!
  • Dataset

    Amazon reviews

    This (link to the) dataset contains product reviews and metadata from Amazon, including 142.8 million reviews spanning May 1996 - July 2014. This dataset includes reviews...
    • HTML
      The resource: 'Julian McAuley's repository.' is not accessible as guest user. You must login to access it!
  • Dataset

    Conversational search dataset with labels

    CAsT 2019 data is split into two files one for training and the other one for testing. - Training set: CAsT 2019 conversations from training set and from test set without...
    • The resource: 'Conversational dataset ...' is not accessible as guest user. You must login to access it!
    • The resource: 'Link to the library' is not accessible as guest user. You must login to access it!
  • Dataset

    Learning to quantify: LeQua 2022 datasets

    The aim of LeQua 2022 (the 1st edition of the CLEF “Learning to Quantify” lab) is to allow the comparative evaluation of methods for “learning to quantify” in textual...
    • The resource: 'Zenodo link' is not accessible as guest user. You must login to access it!
  • Dataset

    Product Reviews for Ordinal Quantification

    This data set comprises a labeled training set, validation samples, and testing samples for ordinal quantification. It appears in our research paper "Ordinal Quantification...
    • The resource: 'Zenodo link' is not accessible as guest user. You must login to access it!
  • Dataset

    Cherenkov Telescope Data for Ordinal Quantification

    This labeled data set is targeted at ordinal quantification. It appears in our research paper "Ordinal Quantification Through Regularization", which we have published at...
    • The resource: 'Zenodo' is not accessible as guest user. You must login to access it!
  • Dataset

    Cross-Lingual Dataset of Crisis-Related Social Media

    If you use this dataset, please cite the following paper: Fedor Vitiugin, Carlos Castillo: Cross-Lingual Query-Based Summarization of Crisis-Related Social Media: An Abstractive...
    • The resource: 'Cross-Lingual Dataset of ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Dataset for Evaluating Abstractive Summaries of Crisis-Related Social Media

    The dataset created for evaluation of summaries generated from social media posted during five natural disasters. The dataset contains: ground truth reports created by human...
    • The resource: 'Dataset for Evaluating ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private Ecology of the digital world of Wikipedia

    Wikipedia, a paradigmatic example of online knowledge space is organized in a collaborative, bottom-up way with voluntary contributions, yet it maintains a level of reliability...
  • TrainingMaterial

    GATE Course

    The material is the 2017 version of a week-long training course delivered annually by the GATE team. Over almost ten years, this course has been developed to provide basic and...
    • PDF
      The resource: 'Module 1 - Introduction to ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 1 - Hands-on materials' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 1 - Introduction to ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 1 - Introduction to ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 1 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 1 - Advanced JAPE' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 1 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - Crowdsourcing ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - GATE Mímir and ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - Introduction to ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - Classification ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - Classification ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - GATE ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - Chunking - ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Hands-on materials for ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 4 - Advanced GATE ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 4 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 4 - Opinion Mining' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 4 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 5 - The GATE ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 5 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 5 - Creating new ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 5 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 5 - Advanced GATE ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 5 - Advanced GATE ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 5 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Applications - ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Applications - ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Applications - ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 6 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Entity Linking' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - JAPE Practical ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 6 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Summarisation ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Data Mining and Machine Learning Module

    The module provides an introduction to base concepts of data mining and knowledge extraction process, introducing analytical models and algorithms for clustering,...
    • PDF
      The resource: 'Introduction' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Case Studies Outline' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Data Preparation and ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Clustering' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Classification' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Machine Learning and Data ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Fraud Detection' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Exemplar Projects on ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Data Mining and Machine Learning for Social Science

    An introductory course for data mining and machine learning for social science. The course focuses on presenting typical data mining and machine learning techniques by using a...
    • PDF
      The resource: 'Data Manipulation' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Data Manipulation with AWK' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Data Manipulation with MySQL' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Data Visualisation and ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Graphical Analysis with ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Introduction to Machine ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Classification and ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'K-Nearest Neighbour Classifier' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Unsupervised Data Mining - ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Unsupervised Density-based ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Mining the Social Web - ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Tracking Language Mobility ...' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).