Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation
We present Reasons For and Against Vaccination (RFAV), a dataset for predicting reasons for and against vaccination, and scientific authorities used to justify them, annotated through nichesourcing and augmented using GPT4 and GPT3.5-Turbo. We show how it is possible to mine these reasons in non-str...
Autores principales: | , , , , , |
---|---|
Formato: | info:eu-repo/semantics/preprint acceptedVersion |
Lenguaje: | Español |
Publicado: |
Universidad Torcuato Di Tella
2024
|
Materias: | |
Acceso en línea: | https://repositorio.utdt.edu/handle/20.500.13098/12857 https://doi.org/10.48550/arXiv.2406.19951 |
Aporte de: |
id |
I57-R163-20.500.13098-12857 |
---|---|
record_format |
dspace |
spelling |
I57-R163-20.500.13098-128572024-07-05T07:00:18Z Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation Navajas, Joaquín Furman, Damián Ariel Junqueras, Juan Gümüslü, Burçe Deroy, Ophelia Sulik, Justin Predicción tecnológica Technological prediction Inteligencia Artificial Artificial Intelligence Data Analysis Análisis de datos Reasons For and Against Vaccination (RFAV) Computation and Language Data mining We present Reasons For and Against Vaccination (RFAV), a dataset for predicting reasons for and against vaccination, and scientific authorities used to justify them, annotated through nichesourcing and augmented using GPT4 and GPT3.5-Turbo. We show how it is possible to mine these reasons in non-structured text, under different task definitions, despite the high level of subjectivity involved and explore the impact of artificially augmented data using in-context learning with GPT4 and GPT3.5-Turbo. We publish the dataset and the trained models along with the annotation manual used to train annotators and define the task. 2024-07-04T15:29:58Z 2024-07-04T15:29:58Z 2024-06-28 info:eu-repo/semantics/preprint info:eu-repo/semantics/acceptedVersion https://repositorio.utdt.edu/handle/20.500.13098/12857 https://doi.org/10.48550/arXiv.2406.19951 spa info:eu-repo/semantics/openAccess https://creativecommons.org/licenses/by-sa/2.5/ar/ 19 p. application/pdf application/pdf Universidad Torcuato Di Tella |
institution |
Universidad Torcuato Di Tella |
institution_str |
I-57 |
repository_str |
R-163 |
collection |
Repositorio Digital Universidad Torcuato Di Tella |
language |
Español |
orig_language_str_mv |
spa |
topic |
Predicción tecnológica Technological prediction Inteligencia Artificial Artificial Intelligence Data Analysis Análisis de datos Reasons For and Against Vaccination (RFAV) Computation and Language Data mining |
spellingShingle |
Predicción tecnológica Technological prediction Inteligencia Artificial Artificial Intelligence Data Analysis Análisis de datos Reasons For and Against Vaccination (RFAV) Computation and Language Data mining Navajas, Joaquín Furman, Damián Ariel Junqueras, Juan Gümüslü, Burçe Deroy, Ophelia Sulik, Justin Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation |
topic_facet |
Predicción tecnológica Technological prediction Inteligencia Artificial Artificial Intelligence Data Analysis Análisis de datos Reasons For and Against Vaccination (RFAV) Computation and Language Data mining |
description |
We present Reasons For and Against Vaccination (RFAV), a dataset for predicting reasons for and against vaccination, and scientific authorities used to justify them, annotated through nichesourcing and augmented using GPT4 and GPT3.5-Turbo. We show how it is possible to mine these reasons in non-structured text, under different task definitions, despite the high level of subjectivity involved and explore the impact of artificially augmented data using in-context learning with GPT4 and GPT3.5-Turbo. We publish the dataset and the trained models along with the annotation manual used to train annotators and define the task. |
format |
info:eu-repo/semantics/preprint acceptedVersion |
author |
Navajas, Joaquín Furman, Damián Ariel Junqueras, Juan Gümüslü, Burçe Deroy, Ophelia Sulik, Justin |
author_facet |
Navajas, Joaquín Furman, Damián Ariel Junqueras, Juan Gümüslü, Burçe Deroy, Ophelia Sulik, Justin |
author_sort |
Navajas, Joaquín |
title |
Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation |
title_short |
Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation |
title_full |
Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation |
title_fullStr |
Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation |
title_full_unstemmed |
Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation |
title_sort |
mining reasons for and against vaccination from unstructured data using nichesourcing and ai data augmentation |
publisher |
Universidad Torcuato Di Tella |
publishDate |
2024 |
url |
https://repositorio.utdt.edu/handle/20.500.13098/12857 https://doi.org/10.48550/arXiv.2406.19951 |
work_keys_str_mv |
AT navajasjoaquin miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation AT furmandamianariel miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation AT junquerasjuan miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation AT gumusluburce miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation AT deroyophelia miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation AT sulikjustin miningreasonsforandagainstvaccinationfromunstructureddatausingnichesourcingandaidataaugmentation |
_version_ |
1808040582234243072 |