Cross domain author profiling task in spanish language: an experimental study
Author Profiling is the task of predicting characteristics of the author of a text, such as age, gender, personality, native language, etc. This is a task of growing importance due to the potential applications in security, crime detection and marketing, among others. An interesting point is to stud...
Guardado en:
Autores principales: | , , , |
---|---|
Formato: | Articulo |
Lenguaje: | Inglés |
Publicado: |
2015
|
Materias: | |
Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/50187 http://journal.info.unlp.edu.ar/wp-content/uploads/JCST41-Paper-12.pdf |
Aporte de: |
id |
I19-R120-10915-50187 |
---|---|
record_format |
dspace |
institution |
Universidad Nacional de La Plata |
institution_str |
I-19 |
repository_str |
R-120 |
collection |
SEDICI (UNLP) |
language |
Inglés |
topic |
Ciencias Informáticas Natural Language Processing Data mining text mining cross domain classification |
spellingShingle |
Ciencias Informáticas Natural Language Processing Data mining text mining cross domain classification Garciarena Ucelay, María José Villegas, María Paula Cagnina, Leticia Errecalde, Marcelo Luis Cross domain author profiling task in spanish language: an experimental study |
topic_facet |
Ciencias Informáticas Natural Language Processing Data mining text mining cross domain classification |
description |
Author Profiling is the task of predicting characteristics of the author of a text, such as age, gender, personality, native language, etc. This is a task of growing importance due to the potential applications in security, crime detection and marketing, among others. An interesting point is to study the robustness of a classifier when it is trained with a data set and tested with others containing different characteristics. Commonly this is called cross domain experimentation.
Although different cross domain studies have been done for data sets in English language, for Spanish it has recently begun. In this context, this work presents a study of cross domain classification for the author profiling task in Spanish. The experimental results showed that using corpora with different levels of formality we can obtain robust classifiers for the author profiling task in Spanish language. |
format |
Articulo Articulo |
author |
Garciarena Ucelay, María José Villegas, María Paula Cagnina, Leticia Errecalde, Marcelo Luis |
author_facet |
Garciarena Ucelay, María José Villegas, María Paula Cagnina, Leticia Errecalde, Marcelo Luis |
author_sort |
Garciarena Ucelay, María José |
title |
Cross domain author profiling task in spanish language: an experimental study |
title_short |
Cross domain author profiling task in spanish language: an experimental study |
title_full |
Cross domain author profiling task in spanish language: an experimental study |
title_fullStr |
Cross domain author profiling task in spanish language: an experimental study |
title_full_unstemmed |
Cross domain author profiling task in spanish language: an experimental study |
title_sort |
cross domain author profiling task in spanish language: an experimental study |
publishDate |
2015 |
url |
http://sedici.unlp.edu.ar/handle/10915/50187 http://journal.info.unlp.edu.ar/wp-content/uploads/JCST41-Paper-12.pdf |
work_keys_str_mv |
AT garciarenaucelaymariajose crossdomainauthorprofilingtaskinspanishlanguageanexperimentalstudy AT villegasmariapaula crossdomainauthorprofilingtaskinspanishlanguageanexperimentalstudy AT cagninaleticia crossdomainauthorprofilingtaskinspanishlanguageanexperimentalstudy AT errecaldemarceloluis crossdomainauthorprofilingtaskinspanishlanguageanexperimentalstudy |
bdutipo_str |
Repositorios |
_version_ |
1764820475562688512 |