13 items found

Organisations: SoBigData Services and Products Availability: On-Site Groups: Societal Debates and Misinformation

Filter Results
  • Dataset

    FAIR-Edu: Gender Bias in Academic Promotion Dataset

    Pseudo anonymized dataset on the publications made by researchers and professors in the Italian Informatics Community (Computer Science + Computer Engineering). Each sample is...
  • Dataset

    ClueWeb09

    The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on...
  • Dataset

    Twitter fake followers

    Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the...
  • Dataset

    Twitter social bots

    Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,...
  • Dataset

    Twitter Dumps

    The dataset consists of the 10% of the daily stream of tweets produced on Twitter filtered into 3 subsets: English, Italian, geo-referenced. The tweets are a random sample of...
  • Dataset

    Brexit Twitter User Vote Intent

    A list of users for which vote intent in the UK EU membership referendum has been established.
  • Dataset

    UK General Election Vote Intent

    A list of Twitter users for whom party political allegiance/vote intent has been established.
  • Dataset

    .ee Web archive

    .ee Web archive consisting of snapshots from 2015
  • Dataset

    Twitter Dataset 2013-2014

    The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st...
  • Dataset

    Articles and comments of major Estonian newspapers

    The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016.
  • Dataset

    ClueWeb12

    The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information...
  • Method

    Digital DNA fingerprinting

    The "Digital DNA fingerprinting" is a spambot detection technique based on the "Digital DNA" online behavioral modeling technique. Given a set of Twitter user timelines, it is...
  • Method

    SCube

    This resource is a segregation discovery method from both tabular and social network data.
You can also access this registry using the API (see API Docs).