An instantiation for sequences of hierarchical distance-based conceptual clustering

In this work, we present an instantiation of our framework for Hierarchical Distance-based Conceptual Clustering (HDCC) using sequences, a particular kind of structured data. We analyze the relationship between distances and generalization operators for sequences in the context of HDCC. HDCC is a ge...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Funes, Ana, Ramírez-Quintana, María José, Hernández-Orallo, Jose, Ferri, Cèsar
Formato: Objeto de conferencia
Lenguaje:Inglés
Publicado: 2011
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/125251
Aporte de:
id I19-R120-10915-125251
record_format dspace
institution Universidad Nacional de La Plata
institution_str I-19
repository_str R-120
collection SEDICI (UNLP)
language Inglés
topic Ciencias Informáticas
conceptual clustering
distance based clustering
Linked lists
sequences
edit distance
spellingShingle Ciencias Informáticas
conceptual clustering
distance based clustering
Linked lists
sequences
edit distance
Funes, Ana
Ramírez-Quintana, María José
Hernández-Orallo, Jose
Ferri, Cèsar
An instantiation for sequences of hierarchical distance-based conceptual clustering
topic_facet Ciencias Informáticas
conceptual clustering
distance based clustering
Linked lists
sequences
edit distance
description In this work, we present an instantiation of our framework for Hierarchical Distance-based Conceptual Clustering (HDCC) using sequences, a particular kind of structured data. We analyze the relationship between distances and generalization operators for sequences in the context of HDCC. HDCC is a general approach to conceptual clustering that extends the traditional algorithm for hierarchical clustering by producing conceptual generalizations of the discovered clusters. Since the approach is general, it allows combining the flexibility of changing distances for different data types at the same time that we take advantage of the interpretability offered by the obtained concepts, which is central for descriptive data mining tasks. We propose here different generalization operators for sequences and analyze how they work together with the edit and linkage distances in HDCC. This analysis is carried out based on three different properties for generalization operators and three different levels of agreement between the clustering hierarchy obtained from the linkage distance and the hierarchy obtained by using generalization operators.
format Objeto de conferencia
Objeto de conferencia
author Funes, Ana
Ramírez-Quintana, María José
Hernández-Orallo, Jose
Ferri, Cèsar
author_facet Funes, Ana
Ramírez-Quintana, María José
Hernández-Orallo, Jose
Ferri, Cèsar
author_sort Funes, Ana
title An instantiation for sequences of hierarchical distance-based conceptual clustering
title_short An instantiation for sequences of hierarchical distance-based conceptual clustering
title_full An instantiation for sequences of hierarchical distance-based conceptual clustering
title_fullStr An instantiation for sequences of hierarchical distance-based conceptual clustering
title_full_unstemmed An instantiation for sequences of hierarchical distance-based conceptual clustering
title_sort instantiation for sequences of hierarchical distance-based conceptual clustering
publishDate 2011
url http://sedici.unlp.edu.ar/handle/10915/125251
work_keys_str_mv AT funesana aninstantiationforsequencesofhierarchicaldistancebasedconceptualclustering
AT ramirezquintanamariajose aninstantiationforsequencesofhierarchicaldistancebasedconceptualclustering
AT hernandezorallojose aninstantiationforsequencesofhierarchicaldistancebasedconceptualclustering
AT ferricesar aninstantiationforsequencesofhierarchicaldistancebasedconceptualclustering
AT funesana instantiationforsequencesofhierarchicaldistancebasedconceptualclustering
AT ramirezquintanamariajose instantiationforsequencesofhierarchicaldistancebasedconceptualclustering
AT hernandezorallojose instantiationforsequencesofhierarchicaldistancebasedconceptualclustering
AT ferricesar instantiationforsequencesofhierarchicaldistancebasedconceptualclustering
bdutipo_str Repositorios
_version_ 1764820451447537666