5 items found

Licenses: Creative Commons Attribution 4.0 Tags: Web mining

Filter Results
  • Dataset

    SWH Filenames

    A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...
    • ZIP
      The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Misinformation Detection ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'DEAP-FAKED: Knowledge ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Research Article' is not accessible as guest user. You must login to access it!