-
SWH Filenames
A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...-
ZIP
The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
-
ZIP
-
Temporal social network reconstruction using wireless proximity sensors: mode...
The emerging technologies of wearable wireless devices open entirely new ways to record various aspects of human social interactions in a broad range of settings. Such...-
HTML
The resource: 'Link to Publication' is not accessible as guest user. You must login to access it!
-
HTML
-
CLiQS
CLiQS is a Python language software package for social media texts summarization with a diversified approach. -
Multi-Task Faces (MTF) dataset
The Multi-Task Faces (MTF) dataset consists of cropped human faces for classification tasks or other research purposes. Each image in the dataset is labelled according to four...-
ZIP
The resource: 'MTF_dataset_20230701' is not accessible as guest user. You must login to access it!
-
ZIP
-
Ariadne Dutch Dendrochronology Entity Recognizer
Identifies terms and phrases in Dutch for analysing archaeological text. The method delivers named entities of archaeological elements, wood material, sample, and date, with...-
method-engine
The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
-
method-engine
-
Ariadne Dutch Archaeology Named Entity Recognizer
Identifies terms and phrases in Dutch for analysing archaeological text. The method delivers named entities of archaeological context, physical object, material, time...-
method-engine
The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
-
method-engine
-
Ariadne English Archaeology Named Entity Recognizer
Identifies terms and phrases in English for analysing archaeological text. The method delivers named entities of archaeological context, physical object, material, time...-
method-engine
The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
-
method-engine
-
Ariadne Swedish Archaeology Named Entity Recognizer
Identifies terms and phrases in Swedish for analysing archaeological text. The method delivers named entities of archaeological context, physical object, material, time...-
method-engine
The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
-
method-engine
-
Ariadne English Dendrochronology Entity Recognizer
Identifies terms and phrases in English for analysing archaeological text. The method delivers named entities of archaeological elements, wood material, sample, and date, with...-
method-engine
The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
-
method-engine
-
Ariadne Swedish Dendrochronology Entity Recognizer
Identifies terms and phrases in Swedish for analysing archaeological text. The method delivers named entities of archaeological elements, wood material, sample, and date, with...-
method-engine
The resource: 'Method Engine' is not accessible as guest user. You must login to access it!
-
method-engine
-
MaxAndSam Network Reconstruction Method
This method reconstructs socio-economic and financial networks from partial information, i.e., the knowledge of intrinsic node-specific properties and of the number of...-
RAR
The resource: 'Reconstruction of ...' is not accessible as guest user. You must login to access it!
-
RAR
-
Learning to quantify: LeQua 2022 datasets
The aim of LeQua 2022 (the 1st edition of the CLEF “Learning to Quantify” lab) is to allow the comparative evaluation of methods for “learning to quantify” in textual... -
Cherenkov Telescope Data for Ordinal Quantification
This labeled data set is targeted at ordinal quantification. It appears in our research paper "Ordinal Quantification Through Regularization", which we have published at... -
VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination ...
We create a publicly available dataset of over 3,100 COVID-19 vaccine-related tweets labeled as one of four stance categories: pro-vaxx, anti-vaxx, vaxx-hesitant, or... -
Ukraine-related Disinformation Dataset
Ukraine-related disinformation dataset from "Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation" (accepted at SocInfo... -
Cross-Lingual Dataset of Crisis-Related Social Media
If you use this dataset, please cite the following paper: Fedor Vitiugin, Carlos Castillo: Cross-Lingual Query-Based Summarization of Crisis-Related Social Media: An Abstractive... -
Angel efficient and effective node centric community discovery in static and ...
Community discovery is one of the most challenging tasks in social network analysis. During the last decades, several algorithms have been proposed with the aim of identifying... -
A theoretical model for pattern discovery in visual analytics
The word ‘pattern’ frequently appears in the visualisation and visual analytics literature, but what do we mean when we talk about patterns? We propose a practicable... -
Boilerplate Removal using a Neural Sequence Labeling Model
The extraction of main content from web pages is an important task for numerous applications, ranging from usability aspects, like reader views for news articles in web...