-
Information Retrieval Module
Study, design and analysis of IR systems which are efficient and effective to process, mine, search, cluster and classify bigdata document collections, coming from textual as...-
PDF
The resource: 'Introduction' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Parsing' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Crawling' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Query Processing' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Index Construction: Sorting' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Random Walks, Ranking and ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Random Walks, Ranking and ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Topic Annotation: Concepts ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Document Compression and ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Document Compression and ...' is not accessible as guest user. You must login to access it!
-
PDF
-
Private Protein-Ligand Interaction Graphs for Affinity Studies
The dataset contains a clean version of the data retrieved from PDBBind in the work of Volkov et al. (2022) that can be used for machine learning-based studies for compound... -
Gene Disease Association Data and Features
This dataset contains data that can be used for disease gene discovery purposes. The data cover ten different diseases with associated seed genes (derived from DisGeNET) and...-
RAR
The resource: 'Gene_Disease_Association_Da ...' is not accessible as guest user. You must login to access it!
-
RAR
-
SWH Filenames
A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...-
ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
-
ZIP
-
Compounds with Activity against the Dopamine D2 Receptor
Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...-
ZIP
The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
-
ZIP
-
CLiQS
CLiQS is a Python language software package for social media texts summarization with a diversified approach. -
Cross-Lingual Dataset of Crisis-Related Social Media
If you use this dataset, please cite the following paper: Fedor Vitiugin, Carlos Castillo: Cross-Lingual Query-Based Summarization of Crisis-Related Social Media: An Abstractive...