Keyword identification in Spanish documents using neural networks
The large amount of textual information digitally available today gives rise to the need for effective means of indexing, searching and retrieving this information. Keywords are used to describe briefly and precisely the contents of a textual document. In this paper we present an algorithm for keywo...
Guardado en:
Autores principales: | , |
---|---|
Formato: | Articulo |
Lenguaje: | Inglés |
Publicado: |
2015
|
Materias: | |
Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/50087 http://journal.info.unlp.edu.ar/wp-content/uploads/JCST41-Paper-2.pdf |
Aporte de: |
id |
I19-R120-10915-50087 |
---|---|
record_format |
dspace |
institution |
Universidad Nacional de La Plata |
institution_str |
I-19 |
repository_str |
R-120 |
collection |
SEDICI (UNLP) |
language |
Inglés |
topic |
Ciencias Informáticas keyword extraction autoencoders Neural nets Redes Neurales (Computación) |
spellingShingle |
Ciencias Informáticas keyword extraction autoencoders Neural nets Redes Neurales (Computación) Aquino, Germán Osvaldo Lanzarini, Laura Cristina Keyword identification in Spanish documents using neural networks |
topic_facet |
Ciencias Informáticas keyword extraction autoencoders Neural nets Redes Neurales (Computación) |
description |
The large amount of textual information digitally available today gives rise to the need for effective means of indexing, searching and retrieving this information. Keywords are used to describe briefly and precisely the contents of a textual document. In this paper we present an algorithm for keyword extraction from documents written in Spanish.This algorithm combines autoencoders, which are adequate for highly unbalanced classification problems, with the discriminative power of conventional binary classifiers. In order to improve its performance on larger and more diverse datasets, our algorithm trains several models of each kind through bagging. |
format |
Articulo Articulo |
author |
Aquino, Germán Osvaldo Lanzarini, Laura Cristina |
author_facet |
Aquino, Germán Osvaldo Lanzarini, Laura Cristina |
author_sort |
Aquino, Germán Osvaldo |
title |
Keyword identification in Spanish documents using neural networks |
title_short |
Keyword identification in Spanish documents using neural networks |
title_full |
Keyword identification in Spanish documents using neural networks |
title_fullStr |
Keyword identification in Spanish documents using neural networks |
title_full_unstemmed |
Keyword identification in Spanish documents using neural networks |
title_sort |
keyword identification in spanish documents using neural networks |
publishDate |
2015 |
url |
http://sedici.unlp.edu.ar/handle/10915/50087 http://journal.info.unlp.edu.ar/wp-content/uploads/JCST41-Paper-2.pdf |
work_keys_str_mv |
AT aquinogermanosvaldo keywordidentificationinspanishdocumentsusingneuralnetworks AT lanzarinilauracristina keywordidentificationinspanishdocumentsusingneuralnetworks |
bdutipo_str |
Repositorios |
_version_ |
1764820475473559552 |