Semi-supervised relevance index for feature selection

dc.creatorFrederico Gualberto Ferreira Coelho
dc.creatorCristiano Leite de Castro
dc.creatorAntônio Braga
dc.creatorMichel Verleysen
dc.date.accessioned2025-04-07T13:35:39Z
dc.date.accessioned2025-09-09T00:13:25Z
dc.date.available2025-04-07T13:35:39Z
dc.date.issued2017
dc.identifier.doi10.1007/s00521-017-3062-0
dc.identifier.issn1433-3058
dc.identifier.urihttps://hdl.handle.net/1843/81329
dc.languageeng
dc.publisherUniversidade Federal de Minas Gerais
dc.relation.ispartofNeural computing and applications
dc.rightsAcesso Restrito
dc.subjectIndices
dc.subjectFontes de informação
dc.subject.otherThis work presented a new method for feature selection that is capable of considering sources of information from labeled as well as from unlabeled data. In this semi-supervised feature selection framework, the method is based on the idea of eliminating redundancy by feature clustering.
dc.subject.otherThe method adopts a novel semi-supervised approach, since labeled and unlabeled data are taken into account in the new similarity index, which is also proposed in this work. SSFC can be directly applied to multiple variables by incorporating them to the MI estimation.
dc.subject.otherStopping criterion for feature clustering can also incorporate further overall performance strategies, since it is based only on the significance level of S. The method, however, achieved competitive results with less features then previous works in the literature with the same data sets. It is interesting to highlight that the proposed method performed well even with a small number of labeled data.
dc.titleSemi-supervised relevance index for feature selection
dc.typeArtigo de periódico
local.citation.epage9
local.citation.spage1
local.citation.volume28
local.description.resumoThis paper presents a new relevance index based on mutual information that is based on labeled and unlabeled data. The proposed index, which is based in Mutual Information, takes into account the similarity between features and their joint influence on the output variable. Based on this principle, a method to select features is developed to eliminate redundant and irrelevant features when the relevance index value is less then a threshold value. A strategy to set the threshold is also proposed in this work. Experiments show that the new method is capable of capturing important joint relations between input and output variables, which are incorporated into a new feature selection clustering approach.
local.publisher.countryBrasil
local.publisher.departmentENG - DEPARTAMENTO DE ENGENHARIA ELÉTRICA
local.publisher.departmentENG - DEPARTAMENTO DE ENGENHARIA ELETRÔNICA
local.publisher.initialsUFMG
local.url.externahttps://link.springer.com/article/10.1007/s00521-017-3062-0

Arquivos

Licença do pacote

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
License.txt
Tamanho:
1.99 KB
Formato:
Plain Text
Descrição: