10 items found

Organisations: SoBigData Services and Products Availability: On-Site Groups: sobigdata-eu Societal Debates and Misinformation

Filter Results
  • Dataset

    ClueWeb09

    The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on...
  • Dataset

    Twitter fake followers

    Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the...
  • Dataset

    Twitter social bots

    Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,...
  • Dataset

    Twitter Dumps

    The dataset consists of the 10% of the daily stream of tweets produced on Twitter filtered into 3 subsets: English, Italian, geo-referenced. The tweets are a random sample of...
  • Dataset

    Brexit Twitter User Vote Intent

    A list of users for which vote intent in the UK EU membership referendum has been established.
  • Dataset

    UK General Election Vote Intent

    A list of Twitter users for whom party political allegiance/vote intent has been established.
  • Dataset

    .ee Web archive

    .ee Web archive consisting of snapshots from 2015
  • Dataset

    Twitter Dataset 2013-2014

    The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st...
  • Dataset

    Articles and comments of major Estonian newspapers

    The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016.
  • Dataset

    ClueWeb12

    The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information...