approved
VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter

We create a publicly available dataset of over 3,100 COVID-19 vaccine-related tweets labeled as one of four stance categories: pro-vaxx, anti-vaxx, vaxx-hesitant, or irrelevant. We split our dataset into two separate files: (1) VaccineHesitancy_train_v2.csv (Single + Double annotated) (2) VaccineHesitancy_test.csv (Double annotated) We present the details of this dataset here: VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter @misc{mu2023vaxxhesitancy, title={VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter}, author={Yida Mu and Mali Jin and Charlie Grimshaw and Carolina Scarton and Kalina Bontcheva and Xingyi Song}, year={2023}, eprint={2301.06660}, archivePrefix={arXiv}, primaryClass={cs.CL} }

Tags
Data and Resources
To access the resources you must log in
  • Zenodo Dataset Link

    The resource: 'Zenodo Dataset Link' is not accessible as guest user. You must login to access it!
Personal Data Attributes

Description: Personal Data related Information

Field Value
Anonymised No
ChildrenData No
Ethics Committee Approval Yes
General Data No
Personal Data No
Personal data was manifestly made public by the data subject No
Sensitive Data No
Additional Info
Field Value
Accessibility Virtual Access
Accessibility Mode Download
Availability On-Line
Basic rights Download
Creation Date 2023-01-13 00:00
Creator Yida Mu, y.mu@sheffield.ac.uk
Dataset Citation @misc{mu2023vaxxhesitancy, title={VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter}, author={Yida Mu and Mali Jin and Charlie Grimshaw and Carolina Scarton and Kalina Bontcheva and Xingyi Song}, year={2023}, eprint={2301.06660}, archivePrefix={arXiv}, primaryClass={cs.CL}}
Dataset Re-Use Safeguards FAIR ecosystem
External Identifier https://doi.org/10.5281/zenodo.7601328
Field/Scope of use Research only
Group Others
Manifestation Type Virtual
Processing Degree Primary
Sublicense rights No
Territory of use World Wide
Thematic Cluster Text and Social Media Mining [TSMM]
system:type Dataset
Management Info
Field Value
Author Yida Mu
Maintainer Yida Mu
Version 1
Last Updated 17 June 2023, 08:23 (CEST)
Created 20 February 2023, 16:55 (CET)