57 items found

Availability: On-Line Groups: Societal Debates and Misinformation Tags: Text mining

Filter Results
  • Dataset

    Synthetic Datasets for Fine-Grained Fairness Analysis of Abusive Language Det...

    Three synthetic datasets covering different types of bias grouped by target, namely sexism, racism and ableism. The reason for distinguishing the records by abuse targets is...
    • CSV
      The resource: 'Synthetic Datasets for ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Lexical networks from Polish news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Polish news articles extracted from the dataset described...
    • jsonl
      The resource: 'polish_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Danish sample)

    The Semantic Networks from news articles (Danish sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Danish_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Lexical networks from Lithuanian news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Lithuanian news articles extracted from the dataset...
    • jsonl
      The resource: 'lithuanian_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Lexical networks from Swedish news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Swedish news articles extracted from the dataset described...
    • jsonl
      The resource: 'swedish_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Lexical networks from Croatian news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Croatian news articles extracted from the dataset...
    • jsonl
      The resource: 'croatian_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Lexical networks from Finnish news articles

    The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Finnish news articles extracted from the dataset...
    • jsonl
      The resource: 'finnish_egoNet_w4' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Romanian sample)

    The Semantic Networks from news articles (Romanian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Romanian_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Dutch sample)

    The Semantic Networks from news articles (Dutch sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Dutch_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (German sample)

    The Semantic Networks from news articles (German sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'German_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Portuguese sample)

    The Semantic Networks from news articles (Portuguese sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Portuguese_sampleNet_anonym ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Spanish sample)

    The Semantic Networks from news articles (Spanish sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Spanish_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (French sample)

    The Semantic Networks from news articles (French sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (English sample)

    The Semantic Networks from news articles (English sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Semantic Networks from news articles (Italian sample)

    The Semantic Networks from news articles (Italian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...
    • CSV
      The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Broad Twitter Corpus

    The Broad Twitter Corpus is a named entity-annotated dataset of tweets, collected in order to capture temporal, spatial and social diversity. The goal of the corpus is to...
    • JSON
      The resource: 'Broad Twitter Corpus' is not accessible as guest user. You must login to access it!
  • Dataset

    Sheffield NERD Tweet Corpus

    The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes...
    • FINF
      The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
  • Method

    Distance Calculator

    The program is intended for calculating semantic distances between input texts. As a commandline script it takes a list of tab-separated text pairs (line-per-pair) and returns...
    • ZIP
      The resource: 'Code' is not accessible as guest user. You must login to access it!
  • Method

    Annie Plus Measurements

    This method allows the annotation of named entities (person, location, organization, date) as well as the numbers and measurement expressions. Default Annotations are: Address,...
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • Method

    Cymrie Welsh Named Entity Recognizer

    The CYMRIE named entity recognition is a service for the analysis of Welsh text. It identifies name of persons, locations, organizations, as well as money amounts, time and...
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).