Published May 9, 2023 | Version v1
Publication

SNN: A Supervised Clustering Algorithm

Description

In this paper, we present a new algorithm based on the nearest neighbours method, for discovering groups and identifying interesting distributions in the underlying data in the labelled databases. We introduces the theory of nearest neighbours sets in order to base the algorithm S-NN (Similar Nearest Neighbours). Traditional clustering algorithms are very sensitive to the user-defined parameters and an expert knowledge is required to choose the values. Frequently, these algorithms are fragile in the presence of outliers and any adjust well to spherical shapes. Experiments have shown that S-NN is accurate discovering arbitrary shapes and density clusters, since it takes into account the internal features of each cluster, and it does not depend on a user-supplied static model. S-NN achieve this by collecting the nearest neighbours with the same label until the enemy is found (it has not the same label). The determinism and the results offered to the researcher turn it into a valuable tool for the representation of the inherent knowledge to the labelled databases.

Abstract

Comisión Interministerial de Ciencia y Tecnología (CICYT) TIC99-0351

Additional details

Identifiers

URL
https://idus.us.es/handle//11441/145698
URN
urn:oai:idus.us.es:11441/145698

Origin repository

Origin repository
USE