3 items found

Licenses: Creative Commons Attribution 4.0 Tags: Text mining Information Retrieval

Filter Results
  • Dataset

    SWH Filenames

    A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...
    • ZIP
      The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
  • Method

    CLiQS

    CLiQS is a Python language software package for social media texts summarization with a diversified approach.
    • The resource: 'CLiQS-CM' is not accessible as guest user. You must login to access it!
  • Dataset

    Cross-Lingual Dataset of Crisis-Related Social Media

    If you use this dataset, please cite the following paper: Fedor Vitiugin, Carlos Castillo: Cross-Lingual Query-Based Summarization of Crisis-Related Social Media: An Abstractive...
    • The resource: 'Cross-Lingual Dataset of ...' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).