38 items found

Licenses: Academic Free License 3.0 Formats: ZIP

Filter Results
  • Access required...

    ×

    Dataset

    Private AMELIA - Argument Mining Evaluation on Legal documents in ItAlian

    This repository contains 225 Italian decisions on Value Added Tax (VAT), annotated in xml, to identify and categorize argumentative text. Based on this data, we propose three...
  • Dataset

    Synthetic Dataset for Photovoltaic Plants

    This synthetic dataset was generated using Gaussian Copula Synthesizer, based on real data from three different photovoltaic plants. The dataset is structured to preserve the...
    • ZIP
      The resource: 'Synthetic_PhotovoltaicSystems' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private neuroAnDetect

    Semi-supervised approach to anomaly detection for neuro-imaging. The method generates segmentation of abnormal tissues that can be used to support medical reporting. The...
  • Dataset

    Telegram data qanonEN chats

    This dataset consists of English-language chats involved in conspiracy discussions on Telegram. The data was collected using a snowball crawling technique that leverages...
    • TXT
      The resource: 'readme' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'telegram_data_qanonEN_chats ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Telegram data cryptoEN chats

    This dataset contains English-language Telegram data focused on discussions related to conspiracy theories and involved in discussions around financial and cryptocurrency...
    • TXT
      The resource: 'readme' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'telegram_data_cryptoEN_chat ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Telegram data conspiracyIT chats

    This dataset contains Italian-language Telegram chats focused on conspiracy discussions. It was collected using a snowball sampling technique based on message forwarding,...
    • TXT
      The resource: 'readme' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'telegram_data_conspiracyIT_ ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Compressed and Learned Data Structures Seminar

    In this seminar cycle, students are guided in the direct usage of a powerful C++ library implementing many state-of-the-art compressed data structures for big data. Other than...
    • PDF
      The resource: 'A gentle introduction to ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Learned indexes, the ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'GitHub Repository' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'GitHub Repository Instructions' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Deep Learning Course

    This course developed by Universitat Politècnica de Catalunya and Barcelona Supercomputing Center provides an applied approach to Deep Learning. It chooses to present an...
    • DOCX
      The resource: 'Instructions' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Deep_Learning_Course' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'High_Performance_Computing_ ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Lesson_a_FeedForward_Neural ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Lesson_b_Recurrent_Neural_N ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Lesson_c_Embedding_Spaces' is not accessible as guest user. You must login to access it!
  • Dataset

    Air Quality Datasets over L'Aquila Region

    These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData.
    • CSV
      The resource: 'CeTEMPS Dataset up to 2023' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'ARTA AirQuality up to 2023' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'ESA Sentinel 5P NO2 daily ...' is not accessible as guest user. You must login to access it!
    • HTML
      The resource: 'Map of the area pollutants ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Wi-Fi Dataset of wireless channel samplings

    The dataset was acquired by periodically sampling a wireless channel with Wi-Fi frames. The main goal is to track the evolution of the channel quality by acquiring key...
    • ZIP
      The resource: 'SoBigData_Wi-Fi_Dataset' is not accessible as guest user. You must login to access it!
  • Experiment

    Annotazione semantica di delibere comunali

    Progetto POC per l'uso delle tecniche di text mining su documenti della pubblica amministrazione per migliorare la trasparenza e l’accesso alle informazioni da parte dei...
    • PDF
      The resource: 'Annotazione Delibere' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Codice sorgente' is not accessible as guest user. You must login to access it!
  • Dataset

    y/Politics 1k

    Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...
    • ZIP
      The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private Cybersecurity NER BERT-base-cased model

    This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that...
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Application

    Private neXSim

    neXSim is a web-based prototype system implementing "implementing a logic based framework for characterising nexus of similarity within knowledge bases", namely expressing in...
  • Dataset

    Spotify track dataset (small)

    The dataset is created exploiting the Spotify API and the tracks id provided by the authors of https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-dataset.... The...
    • ZIP
      The resource: 'std_small' is not accessible as guest user. You must login to access it!
  • Dataset

    GiveMeSomeCreditSC

    The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...
    • ZIP
      The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
  • Dataset

    Santorini Tweets July-August 2021

    This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...
    • ZIP
      The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).