Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation

We present Reasons For and Against Vaccination (RFAV), a dataset for predicting reasons for and against vaccination, and scientific authorities used to justify them, annotated through nichesourcing and augmented using GPT4 and GPT3.5-Turbo. We show how it is possible to mine these reasons in non-str...

Descripción completa

Detalles Bibliográficos
Autores principales: Navajas, Joaquín, Furman, Damián Ariel, Junqueras, Juan, Gümüslü, Burçe, Deroy, Ophelia, Sulik, Justin
Formato: info:eu-repo/semantics/preprint acceptedVersion
Lenguaje:Español
Publicado: Universidad Torcuato Di Tella 2024
Materias:
Acceso en línea:https://repositorio.utdt.edu/handle/20.500.13098/12857
https://doi.org/10.48550/arXiv.2406.19951
Aporte de:
id I57-R163-20.500.13098-12857
record_format dspace
spelling I57-R163-20.500.13098-128572024-07-05T07:00:18Z Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation Navajas, Joaquín Furman, Damián Ariel Junqueras, Juan Gümüslü, Burçe Deroy, Ophelia Sulik, Justin Predicción tecnológica Technological prediction Inteligencia Artificial Artificial Intelligence Data Analysis Análisis de datos Reasons For and Against Vaccination (RFAV) Computation and Language Data mining We present Reasons For and Against Vaccination (RFAV), a dataset for predicting reasons for and against vaccination, and scientific authorities used to justify them, annotated through nichesourcing and augmented using GPT4 and GPT3.5-Turbo. We show how it is possible to mine these reasons in non-structured text, under different task definitions, despite the high level of subjectivity involved and explore the impact of artificially augmented data using in-context learning with GPT4 and GPT3.5-Turbo. We publish the dataset and the trained models along with the annotation manual used to train annotators and define the task. 2024-07-04T15:29:58Z 2024-07-04T15:29:58Z 2024-06-28 info:eu-repo/semantics/preprint info:eu-repo/semantics/acceptedVersion https://repositorio.utdt.edu/handle/20.500.13098/12857 https://doi.org/10.48550/arXiv.2406.19951 spa info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-sa/2.5/ar/ 19 p. application/pdf application/pdf Universidad Torcuato Di Tella
institution Universidad Torcuato Di Tella
institution_str I-57
repository_str R-163
collection Repositorio Digital Universidad Torcuato Di Tella
language Español
orig_language_str_mv spa
topic Predicción tecnológica
Technological prediction
Inteligencia Artificial
Artificial Intelligence
Data Analysis
Análisis de datos
Reasons For and Against Vaccination (RFAV)
Computation and Language
Data mining
spellingShingle Predicción tecnológica
Technological prediction
Inteligencia Artificial
Artificial Intelligence
Data Analysis
Análisis de datos
Reasons For and Against Vaccination (RFAV)
Computation and Language
Data mining
Navajas, Joaquín
Furman, Damián Ariel
Junqueras, Juan
Gümüslü, Burçe
Deroy, Ophelia
Sulik, Justin
Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation
topic_facet Predicción tecnológica
Technological prediction
Inteligencia Artificial
Artificial Intelligence
Data Analysis
Análisis de datos
Reasons For and Against Vaccination (RFAV)
Computation and Language
Data mining
description We present Reasons For and Against Vaccination (RFAV), a dataset for predicting reasons for and against vaccination, and scientific authorities used to justify them, annotated through nichesourcing and augmented using GPT4 and GPT3.5-Turbo. We show how it is possible to mine these reasons in non-structured text, under different task definitions, despite the high level of subjectivity involved and explore the impact of artificially augmented data using in-context learning with GPT4 and GPT3.5-Turbo. We publish the dataset and the trained models along with the annotation manual used to train annotators and define the task.
format info:eu-repo/semantics/preprint
acceptedVersion
author Navajas, Joaquín
Furman, Damián Ariel
Junqueras, Juan
Gümüslü, Burçe
Deroy, Ophelia
Sulik, Justin
author_facet Navajas, Joaquín
Furman, Damián Ariel
Junqueras, Juan
Gümüslü, Burçe
Deroy, Ophelia
Sulik, Justin
author_sort Navajas, Joaquín
title Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation
title_short Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation
title_full Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation
title_fullStr Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation
title_full_unstemmed Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation
title_sort mining reasons for and against vaccination from unstructured data using nichesourcing and ai data augmentation
publisher Universidad Torcuato Di Tella
publishDate 2024
url https://repositorio.utdt.edu/handle/20.500.13098/12857
https://doi.org/10.48550/arXiv.2406.19951
work_keys_str_mv AT navajasjoaquin miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation
AT furmandamianariel miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation
AT junquerasjuan miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation
AT gumusluburce miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation
AT deroyophelia miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation
AT sulikjustin miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation
_version_ 1808040582234243072