DyClee-C: a clustering algorithm for categorical data based diagnosis - LAAS - Laboratoire d'Analyse et d'Architecture des Systèmes Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

DyClee-C: a clustering algorithm for categorical data based diagnosis

Résumé

In data-based diagnostic applications, large amounts of data are often available but the data re- mains unlabelled because labelling would require too much time and imply prohibitive costs. The different situations, e.g. normal or faulty, must hence be learned from the data. Clustering methods, also qualified as unsupervised classification methods, can then be used to create groups of samples according to some sim- ilarity criterion. The different groups can sup- posedly be associated to different situations. Nu- merous algorithms have been developed in recent years for clustering numeric data but these meth- ods are not applicable to categorical data. How- ever, in many application domains, categorical features are key to properly describe the differ- ent situations. This paper presents DyClee-C, an extension of the numeric feature based DyClee al- gorithm to categorical data. DyClee-C is applied to two data sets: a soybean data set to diagnose the disease soybean plants and a breast cancer data set to assess the current diagnosis in terms of recur- rence events and prognose possible relapse.
Fichier principal
Vignette du fichier
DyClee_C_DX19_Final.pdf (708.95 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02383492 , version 1 (27-11-2019)

Identifiants

  • HAL Id : hal-02383492 , version 1

Citer

Tom Obry, Louise Travé-Massuyès, Audine Subias. DyClee-C: a clustering algorithm for categorical data based diagnosis. DX'19 – 30th International Workshop on Principles of Diagnosis, Nov 2019, Klagenfurt, Austria. ⟨hal-02383492⟩
54 Consultations
8 Téléchargements

Partager

Gmail Facebook X LinkedIn More