52 items found

Types: Dataset Groups: Societal Debates and Misinformation

Filter Results
  • Dataset

    Social Network dataset - LiveJournal

    LiveJournal is a free on-line blogging community where users declare friendship each other. LiveJournal also allows users form a group which other members can then join. We...
    • HTML
      The resource: 'LiveJournal social network ...' is not accessible as guest user. You must login to access it!
  • Dataset

    .ee Web archive

    .ee Web archive consisting of snapshots from 2015
  • Dataset

    Broad Twitter Corpus

    The Broad Twitter Corpus is a named entity-annotated dataset of tweets, collected in order to capture temporal, spatial and social diversity. The goal of the corpus is to...
    • JSON
      The resource: 'Broad Twitter Corpus' is not accessible as guest user. You must login to access it!
  • Dataset

    UK election abuse data

    The GATE team (gate.ac.uk) at the University of Sheffield have collected 1.4 million tweets sent to and by UK members of parliament in the months leading up to the 2015 and...
    • XLS
      The resource: 'uk-election-abuse.tar.gz' is not accessible as guest user. You must login to access it!
  • Dataset

    Twitter Dataset 2013-2014

    The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st...
  • Dataset

    Articles and comments of major Estonian newspapers

    The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016.
  • Dataset

    ClueWeb12

    The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information...
  • Dataset

    Sheffield NERD Tweet Corpus

    The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes...
    • FINF
      The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Covid infodemic in Italy -- Most retweeted accounts

    Top 10 most retweeted accounts on Covid-related keywords, between Jan 30 and Mar 20, 2020.
    • ZIP
      The resource: 'dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Twitter Dataset British MPs

    This dataset contains the Twitter tweet_ids from the Timelines of 584 members of British Parliament (collected between 4th and 6th of March 2022). The users are identified from...
    • TSV
      The resource: 'Twitter Dataset British MPs' is not accessible as guest user. You must login to access it!
  • Dataset

    A dataset of journalists on Twitter

    This dataset comprises the Twitter timelines of journalists belonging to 17 different countries from 8 different continental regions, downloaded in May 2018. We used the Twitter...
    • HTML
      The resource: 'Journalists dataset' is not accessible as guest user. You must login to access it!
You can also access this registry using the API (see API Docs).