43 items found

Tags: Information Retrieval

Filter Results
  • TrainingMaterial

    Information Retrieval Module

    Study, design and analysis of IR systems which are efficient and effective to process, mine, search, cluster and classify bigdata document collections, coming from textual as...
    • PDF
      The resource: 'Introduction' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Parsing' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Crawling' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Query Processing' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Index Construction: Sorting' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Random Walks, Ranking and ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Random Walks, Ranking and ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Topic Annotation: Concepts ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Document Compression and ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Document Compression and ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Jupyter Notebooks

    King’s College London has developed complete stories around Jupyter Notebooks that form easy recipes for reproducible methods in social data science. Jupyter...
    • ZIP
      The resource: 'Historical Cultures Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Prediction Modelling ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Social and Cultural ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Social Sensing Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Visual Arts Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Ananke Guide' is not accessible as guest user. You must login to access it!
    • mp4
      The resource: 'Ananke Guide Video' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Database Module

    The 'Database Module' aims to introduce database analysis, focusing on DBMS architecture, Relational Models, SQL language and SQL nested queries. It is part of the Master in...
    • PDF
      The resource: 'Introduction to Database ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Relational Model Module' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Database SQL Module' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Inner Queries and Views Module' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Interactive Learning Environments

    King’s College London developed a variety of data science materials based on R and Python. R is a de facto standard in statistical computing and visualisation, while our...
    • ZIP
      The resource: 'Rstudio docker image' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'VirtualBox' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Swirl courses' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Tutorial on Learning To Rank

    Efficiency/Effectiveness Trade-offs in Learning to Rank” tutorial by Claudio Lucchese and Franco Maria Nardini at the European Conference on Machine Learning and Principles...
    • DOCX
      The resource: 'Instructions' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Slides-Part1' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Slides-Part2' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Slides-Part3' is not accessible as guest user. You must login to access it!
    • ipynb
      The resource: 'HandsOn-1' is not accessible as guest user. You must login to access it!
    • ipynb
      The resource: 'HandsOn-2' is not accessible as guest user. You must login to access it!
    • tar.gz
      The resource: 'Hands-On 1/2, QuickRank ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private Cybersecurity NER BERT-base-cased model

    This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that...
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Cybersecurity NER dataset

    Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and...
  • Access required...

    ×

    Dataset

    Private Protein-Ligand Interaction Graphs for Affinity Studies

    The dataset contains a clean version of the data retrieved from PDBBind in the work of Volkov et al. (2022) that can be used for machine learning-based studies for compound...
  • Dataset

    Gene Disease Association Data and Features

    This dataset contains data that can be used for disease gene discovery purposes. The data cover ten different diseases with associated seed genes (derived from DisGeNET) and...
    • RAR
      The resource: 'Gene_Disease_Association_Da ...' is not accessible as guest user. You must login to access it!
  • Dataset

    UWB RADAR dataset of human activity detection in smart office

    The UWB RADAR dataset consists of time series data acquired from UWB RADAR deployed in a smart office room located in ICAR-CNR, for monitoring human activity detection. Raw...
    • RAR
      The resource: 'IoT_UWB_RADAR_dataset_for_s ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental outdoor home conditions

    The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in outdoor of a smart domestic room located in...
    • The resource: 'IoT_dataset_outdoor_smart_home' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental indoor home conditions

    The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in indoor of a smart domestic room located in...
    • RAR
      The resource: 'IoT_dataset_indoor_smart_home' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental office room conditions

    The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in smart office rooms located in ICAR-CNR, for...
    • RAR
      The resource: 'IoT_dataset_smart_office' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental conditions in smart office

    The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in a smart office located in the ICAR CNR IoT...
    • RAR
      The resource: 'Laboratorio IoT' is not accessible as guest user. You must login to access it!
  • Dataset

    User preference-interest dataset

    The User preference-interest dataset is a comprehensive collection of preferences generated by a sequence of 6 regimes following the rules below: - initially, we have...
    • The resource: 'User preference-interest ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Weather and Pollution in Smart Cities

    A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based...
    • ZIP
      The resource: 'Weather and Pollution in ...' is not accessible as guest user. You must login to access it!
  • Dataset

    SWH Filenames

    A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...
    • ZIP
      The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Smart Cities Weather and Pollution conditions

    A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based...
You can also access this registry using the API (see API Docs).