-
Private AMELIA - Argument Mining Evaluation on Legal documents in ItAlian
This repository contains 225 Italian decisions on Value Added Tax (VAT), annotated in xml, to identify and categorize argumentative text. Based on this data, we propose three... -
Synthetic Dataset for Photovoltaic Plants
This synthetic dataset was generated using Gaussian Copula Synthesizer, based on real data from three different photovoltaic plants. The dataset is structured to preserve the...-
ZIP
The resource: 'Synthetic_PhotovoltaicSystems' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private neuroAnDetect
Semi-supervised approach to anomaly detection for neuro-imaging. The method generates segmentation of abnormal tissues that can be used to support medical reporting. The... -
Telegram data qanonEN chats
This dataset consists of English-language chats involved in conspiracy discussions on Telegram. The data was collected using a snowball crawling technique that leverages... -
Telegram data cryptoEN chats
This dataset contains English-language Telegram data focused on discussions related to conspiracy theories and involved in discussions around financial and cryptocurrency... -
Telegram data conspiracyIT chats
This dataset contains Italian-language Telegram chats focused on conspiracy discussions. It was collected using a snowball sampling technique based on message forwarding,... -
Compressed and Learned Data Structures Seminar
In this seminar cycle, students are guided in the direct usage of a powerful C++ library implementing many state-of-the-art compressed data structures for big data. Other than...-
PDF
The resource: 'A gentle introduction to ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Learned indexes, the ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'GitHub Repository' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'GitHub Repository Instructions' is not accessible as guest user. You must login to access it!
-
PDF
-
Deep Learning Course
This course developed by Universitat Politècnica de Catalunya and Barcelona Supercomputing Center provides an applied approach to Deep Learning. It chooses to present an...-
DOCX
The resource: 'Instructions' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Deep_Learning_Course' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'High_Performance_Computing_ ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Lesson_a_FeedForward_Neural ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Lesson_b_Recurrent_Neural_N ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Lesson_c_Embedding_Spaces' is not accessible as guest user. You must login to access it!
-
DOCX
-
Air Quality Datasets over L'Aquila Region
These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData.-
CSV
The resource: 'CeTEMPS Dataset up to 2023' is not accessible as guest user. You must login to access it!
-
CSV
The resource: 'ARTA AirQuality up to 2023' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'ESA Sentinel 5P NO2 daily ...' is not accessible as guest user. You must login to access it!
-
HTML
The resource: 'Map of the area pollutants ...' is not accessible as guest user. You must login to access it!
-
CSV
-
-
ZIP
The resource: 'Dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
Wi-Fi Dataset of wireless channel samplings
The dataset was acquired by periodically sampling a wireless channel with Wi-Fi frames. The main goal is to track the evolution of the channel quality by acquiring key...-
ZIP
The resource: 'SoBigData_Wi-Fi_Dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
Annotazione semantica di delibere comunali
Progetto POC per l'uso delle tecniche di text mining su documenti della pubblica amministrazione per migliorare la trasparenza e l’accesso alle informazioni da parte dei... -
y/Politics 1k
Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...-
ZIP
The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Cybersecurity NER BERT-base-cased model
This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that... -
Cybersecurity NER RoBERTa-base model
This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...-
JSON
The resource: 'config' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'merges' is not accessible as guest user. You must login to access it!
-
BIN
The resource: 'model' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'model_args' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'scheduler' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'training_args' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'vocab' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'optimizer' is not accessible as guest user. You must login to access it!
-
py
The resource: 'inference' is not accessible as guest user. You must login to access it!
-
JSON
-
Cybersecurity NER SecureBERT model
This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...-
JSON
The resource: 'config' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'merges' is not accessible as guest user. You must login to access it!
-
BIN
The resource: 'model' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'model_args' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'optimizer' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'scheduler' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'training_args' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'vocab' is not accessible as guest user. You must login to access it!
-
text/x-python
The resource: 'inference' is not accessible as guest user. You must login to access it!
-
JSON
-
Private neXSim
neXSim is a web-based prototype system implementing "implementing a logic based framework for characterising nexus of similarity within knowledge bases", namely expressing in... -
Spotify track dataset (small)
The dataset is created exploiting the Spotify API and the tracks id provided by the authors of https://www.kaggle.com/datasets/maharshipandya/-spotify-tracks-dataset.... The...-
ZIP
The resource: 'std_small' is not accessible as guest user. You must login to access it!
-
ZIP
-
GiveMeSomeCreditSC
The GiveMeSomeCredit dataset - https://www.kaggle.com/c/GiveMeSomeCredit - contains different features of borrowers. The task is predicting the financial distress of a...-
ZIP
The resource: 'GiveMeSomeCreditSC' is not accessible as guest user. You must login to access it!
-
ZIP
-
Santorini Tweets July-August 2021
This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...-
ZIP
The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
-
ZIP