-
Multi-flow Composition in video streaming channels
This experiment is part the project "Streams of conspiratorial folklore" that investigates online media as a stream of performances, rather than as archives of documents. Our... -
Twitter Monitor
The Twitter Monitor is an interactive Web application designed to access the Twitter stream by exploiting the public Twitter Streaming APIs. The application can manage...-
HTML
The resource: 'Twitter Monitor URL' is not accessible as guest user. You must login to access it!
-
The resource: 'Twitter Monitor method' is not accessible as guest user. You must login to access it!
-
HTML
-
SMAPH Query Entity Linker
The SMAPH system links queries to the entities it mentions, disambiguating mentions if needed. Entities are Wikipedia pages. This problem is known as "entity recognition and...-
HTML
The resource: 'SMAPH documentation' is not accessible as guest user. You must login to access it!
-
HTML
-
The news tells us about peace - dashboard
Previous research demonstrates that the official Global Peace Index (GPI) can be captured at a higher frequency through GDELT, a digital news database. We have created a... -
Quantum Distance-Based Classifier
The Quantum Distance-Based Classifier is a technique inspired by the classical k-Nearest Neighbors that leverages quantum properties to perform prediction. -
Analysing Meme Collections with the Computer Vision Network Approach [Video T...
The video tutorial presents techniques for analysing meme collections with the computer vision network approach, including seven steps for network building and interpretation,... -
Making meme collections [Video Tutorial]
The video tutorial discusses making meme collections, including the meme as a technical collection of objects, automated visual analysis, and meme collection distinctiveness. -
Using Computer Vision Techniques to Study Images from the Web [Video Tutorial]
The video tutorial discusses using computer vision techniques to study online images, including case studies, research methods, challenges faced, and lessons learned. -
Multi-Task Faces (MTF) dataset
The Multi-Task Faces (MTF) dataset consists of cropped human faces for classification tasks or other research purposes. Each image in the dataset is labelled according to four...-
ZIP
The resource: 'MTF_dataset_20230701' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Boilernet
Deploys an artificial neural network to remove the boilerplate from HTML files. Annotates the text content in the file or extracts the text from the HTML file. -
The Italian Music Dataset
The dataset is built by exploiting the Spotify and SoundCloud APIs. It is composed of over 14,500 different songs of both famous and less famous Italian musicians. Each song...-
JSON
The resource: 'Dataset' is not accessible as guest user. You must login to access it!
-
JSON
-
German Academic Web
The dataset contains regular crawls of the websites for German academic institutions. -
GERDAQ Dataset
This is a benchmark dataset of annotated search-engine queries. Mentions of entities in search-engine queries are tagged with the entity they refer to. Wikipedia is used as...-
XML
The resource: 'GERDAQ dataset' is not accessible as guest user. You must login to access it!
-
XML
-
MSN Search query log
The data consists of an MSN Search query log excerpt with 15 million queries, from US users, sampled over one month of activity. Data attributes made available per query: 1)... -
ArchiveSpark
ArchiveSpark is an Apache Spark framework for easy data access, processing, extraction as well as derivation for Web archives and archival collections. It has a simple and... -
Wikipedia Word Embeddings
Embeddings were created through applying word2vec skipgram to a corpus of wikipedia non-stub articles from a December 2015 English dump with the following parameters: -cbow 0... -
CoPhIR
The CoPhIR (Content-based Photo Image Retrieval) Test-Collection has been developed to make significant tests on the scalability of the SAPIR project infrastructure (SAPIR:... -
Product Reviews for Ordinal Quantification
This data set comprises a labeled training set, validation samples, and testing samples for ordinal quantification. It appears in our research paper "Ordinal Quantification... -
The Propagation of Misinformation in Social Media
There is growing awareness about how social media circulate extreme viewpoints and turn up the temperature of public debate. Posts that exhibit agitation garner...