A new AntTree-based algorithm for clustering short-text corpora

Research work on "short-text clustering" is a very important research area due to the current tendency for people to use "small-language", e.g. blogs, textmessaging and others. In some recent works, new bioinspired clustering algorithms have been proposed to deal with this diffic...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Errecalde, Marcelo Luis, Ingaramo, Diego Alejandro, Rosso, Paolo
Formato: Articulo
Lenguaje:Inglés
Publicado: 2010
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/9660
http://journal.info.unlp.edu.ar/wp-content/uploads/JCST-Apr10-1.pdf
Aporte de:
id I19-R120-10915-9660
record_format dspace
institution Universidad Nacional de La Plata
institution_str I-19
repository_str R-120
collection SEDICI (UNLP)
language Inglés
topic Ciencias Informáticas
short-text clustering
bio-inspired algorithms
internal validity measures
silhouette coefficient
spellingShingle Ciencias Informáticas
short-text clustering
bio-inspired algorithms
internal validity measures
silhouette coefficient
Errecalde, Marcelo Luis
Ingaramo, Diego Alejandro
Rosso, Paolo
A new AntTree-based algorithm for clustering short-text corpora
topic_facet Ciencias Informáticas
short-text clustering
bio-inspired algorithms
internal validity measures
silhouette coefficient
description Research work on "short-text clustering" is a very important research area due to the current tendency for people to use "small-language", e.g. blogs, textmessaging and others. In some recent works, new bioinspired clustering algorithms have been proposed to deal with this difficult problem and novel uses of Internal Clustering Validity Measures have also been presented. In this work, a new AntTree-based approach is proposed for this task. It integrates information on the Silhouette Coefficient and the concept of attraction of a cluster in different stages of the clustering process. The proposal achieves results comparable to the best reported results in this area, showing an interesting stability in the quality of the results and presenting some interesting capabilities as a general improvement method for arbitrary clustering approaches.
format Articulo
Articulo
author Errecalde, Marcelo Luis
Ingaramo, Diego Alejandro
Rosso, Paolo
author_facet Errecalde, Marcelo Luis
Ingaramo, Diego Alejandro
Rosso, Paolo
author_sort Errecalde, Marcelo Luis
title A new AntTree-based algorithm for clustering short-text corpora
title_short A new AntTree-based algorithm for clustering short-text corpora
title_full A new AntTree-based algorithm for clustering short-text corpora
title_fullStr A new AntTree-based algorithm for clustering short-text corpora
title_full_unstemmed A new AntTree-based algorithm for clustering short-text corpora
title_sort new anttree-based algorithm for clustering short-text corpora
publishDate 2010
url http://sedici.unlp.edu.ar/handle/10915/9660
http://journal.info.unlp.edu.ar/wp-content/uploads/JCST-Apr10-1.pdf
work_keys_str_mv AT errecaldemarceloluis anewanttreebasedalgorithmforclusteringshorttextcorpora
AT ingaramodiegoalejandro anewanttreebasedalgorithmforclusteringshorttextcorpora
AT rossopaolo anewanttreebasedalgorithmforclusteringshorttextcorpora
AT errecaldemarceloluis newanttreebasedalgorithmforclusteringshorttextcorpora
AT ingaramodiegoalejandro newanttreebasedalgorithmforclusteringshorttextcorpora
AT rossopaolo newanttreebasedalgorithmforclusteringshorttextcorpora
bdutipo_str Repositorios
_version_ 1764820492286427141