A new AntTree-based algorithm for clustering short-text corpora
Research work on "short-text clustering" is a very important research area due to the current tendency for people to use "small-language", e.g. blogs, textmessaging and others. In some recent works, new bioinspired clustering algorithms have been proposed to deal with this diffic...
Guardado en:
| Autores principales: | , , |
|---|---|
| Formato: | Articulo |
| Lenguaje: | Inglés |
| Publicado: |
2010
|
| Materias: | |
| Acceso en línea: | http://sedici.unlp.edu.ar/handle/10915/9660 http://journal.info.unlp.edu.ar/wp-content/uploads/JCST-Apr10-1.pdf |
| Aporte de: |
| id |
I19-R120-10915-9660 |
|---|---|
| record_format |
dspace |
| institution |
Universidad Nacional de La Plata |
| institution_str |
I-19 |
| repository_str |
R-120 |
| collection |
SEDICI (UNLP) |
| language |
Inglés |
| topic |
Ciencias Informáticas short-text clustering bio-inspired algorithms internal validity measures silhouette coefficient |
| spellingShingle |
Ciencias Informáticas short-text clustering bio-inspired algorithms internal validity measures silhouette coefficient Errecalde, Marcelo Luis Ingaramo, Diego Alejandro Rosso, Paolo A new AntTree-based algorithm for clustering short-text corpora |
| topic_facet |
Ciencias Informáticas short-text clustering bio-inspired algorithms internal validity measures silhouette coefficient |
| description |
Research work on "short-text clustering" is a very important research area due to the current tendency for people to use "small-language", e.g. blogs, textmessaging and others. In some recent works, new bioinspired clustering algorithms have been proposed to deal with this difficult problem and novel uses of Internal Clustering Validity Measures have also been presented. In this work, a new AntTree-based approach is proposed for this task. It integrates information on the Silhouette Coefficient and the concept of attraction of a cluster in different stages of the clustering process. The proposal achieves results comparable to the best reported results in this area, showing an interesting stability in the quality of the results and presenting some interesting capabilities as a general improvement method for arbitrary clustering approaches. |
| format |
Articulo Articulo |
| author |
Errecalde, Marcelo Luis Ingaramo, Diego Alejandro Rosso, Paolo |
| author_facet |
Errecalde, Marcelo Luis Ingaramo, Diego Alejandro Rosso, Paolo |
| author_sort |
Errecalde, Marcelo Luis |
| title |
A new AntTree-based algorithm for clustering short-text corpora |
| title_short |
A new AntTree-based algorithm for clustering short-text corpora |
| title_full |
A new AntTree-based algorithm for clustering short-text corpora |
| title_fullStr |
A new AntTree-based algorithm for clustering short-text corpora |
| title_full_unstemmed |
A new AntTree-based algorithm for clustering short-text corpora |
| title_sort |
new anttree-based algorithm for clustering short-text corpora |
| publishDate |
2010 |
| url |
http://sedici.unlp.edu.ar/handle/10915/9660 http://journal.info.unlp.edu.ar/wp-content/uploads/JCST-Apr10-1.pdf |
| work_keys_str_mv |
AT errecaldemarceloluis anewanttreebasedalgorithmforclusteringshorttextcorpora AT ingaramodiegoalejandro anewanttreebasedalgorithmforclusteringshorttextcorpora AT rossopaolo anewanttreebasedalgorithmforclusteringshorttextcorpora AT errecaldemarceloluis newanttreebasedalgorithmforclusteringshorttextcorpora AT ingaramodiegoalejandro newanttreebasedalgorithmforclusteringshorttextcorpora AT rossopaolo newanttreebasedalgorithmforclusteringshorttextcorpora |
| bdutipo_str |
Repositorios |
| _version_ |
1764820492286427141 |