A self-generating prototype method based on information entropy used for condensing data in classification tasks

Tipo
Artigo de evento
Data de publicação
2019
Periódico
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Citações (Scopus)
0
Autores
Manastarla A.
Silva L.A.
Orientador
Título da Revista
ISSN da Revista
Título de Volume
Membros da banca
Programa
Resumo
© 2019, Springer Nature Switzerland AG.This paper presents a new self-generating prototype method based on information entropy to reduce the size of training datasets. The method accelerates the classifier training time without significantly decreasing the quality in the data classification task. The effectiveness of the proposed method is compared to the K-nearest neighbour classifier (kNN) and the genetic algorithm prototype selection (GA). kNN is a benchmark method used for data classification tasks, while GA is a prototype selection method that provides competitive optimisation of accuracy and the data reduction ratio. Considering thirty different public datasets, the results of the comparisons demonstrate that the proposed method outperforms kNN when using the original training set as well as the reduced training set obtained via GA prototype selection.
Descrição
Palavras-chave
Assuntos Scopus
Classification tasks , Classifier training , Data classification , Information entropy , K-nearest neighbours , Prototype selection , Reduced training sets , Training data sets
Citação
DOI (Texto completo)