43 items found

Tags: Information Retrieval

Filter Results
  • Access required...

    ×

    Method

    Private Cybersecurity NER BERT-base-cased model

    This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that...
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Cybersecurity NER dataset

    Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and...
  • Access required...

    ×

    Dataset

    Private Protein-Ligand Interaction Graphs for Affinity Studies

    The dataset contains a clean version of the data retrieved from PDBBind in the work of Volkov et al. (2022) that can be used for machine learning-based studies for compound...
  • Dataset

    Gene Disease Association Data and Features

    This dataset contains data that can be used for disease gene discovery purposes. The data cover ten different diseases with associated seed genes (derived from DisGeNET) and...
    • RAR
      The resource: 'Gene_Disease_Association_Da ...' is not accessible as guest user. You must login to access it!
  • Dataset

    UWB RADAR dataset of human activity detection in smart office

    The UWB RADAR dataset consists of time series data acquired from UWB RADAR deployed in a smart office room located in ICAR-CNR, for monitoring human activity detection. Raw...
    • RAR
      The resource: 'IoT_UWB_RADAR_dataset_for_s ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental outdoor home conditions

    The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in outdoor of a smart domestic room located in...
    • The resource: 'IoT_dataset_outdoor_smart_home' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental indoor home conditions

    The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in indoor of a smart domestic room located in...
    • RAR
      The resource: 'IoT_dataset_indoor_smart_home' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental office room conditions

    The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in smart office rooms located in ICAR-CNR, for...
    • RAR
      The resource: 'IoT_dataset_smart_office' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental conditions in smart office

    The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in a smart office located in the ICAR CNR IoT...
    • RAR
      The resource: 'Laboratorio IoT' is not accessible as guest user. You must login to access it!
  • Dataset

    User preference-interest dataset

    The User preference-interest dataset is a comprehensive collection of preferences generated by a sequence of 6 regimes following the rules below: - initially, we have...
    • The resource: 'User preference-interest ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Weather and Pollution in Smart Cities

    A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based...
    • ZIP
      The resource: 'Weather and Pollution in ...' is not accessible as guest user. You must login to access it!
  • Dataset

    SWH Filenames

    A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...
    • ZIP
      The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Smart Cities Weather and Pollution conditions

    A set of weather and climatic conditions gathered during the Toolsmart PoN project ( Open Community PA 2020 – Pon Governance 2014-2020). Data are obtained from IoT based...
  • Dataset

    Compounds with Activity against the Dopamine D2 Receptor

    Database containing compounds active against the dopamine D2 receptor together with random inactive compounds as negative samples for learning purposes. Train, validation, and...
    • ZIP
      The resource: 'compound_activity_dopamine_d2' is not accessible as guest user. You must login to access it!
  • Dataset

    FANCY Dataset

    (NLI) FANCY (FActivity, Negation, Common-sense, hYpernimy) is a new dataset with 4000 sentence pairs concerning complex linguistic phenomena such as factivity, negation,...
    • The resource: 'FANCY Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Santorini Tweets July-August 2021

    This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...
    • ZIP
      The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Italian Thesaurus for Tourism domain

    An Italian thesaurus in the domain of the Tourism, counting 2,684 concepts, organized according to semantic relationships (equivalence, hierarchical and associative). The...
  • Dataset

    Italian Tourism Dataset

    A set of users' comments crawled and scraped from two main touristic websites (Booking.com and Tripadvisor.com) related to main touristic point of interests in Italy and, in...
    • HTML
      The resource: 'tourism-dataset' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).