A mutual information estimator for continuous and discrete variables applied to feature selection and classification problems
Carregando...
Data
Título da Revista
ISSN da Revista
Título de Volume
Editor
Universidade Federal de Minas Gerais
Descrição
Tipo
Artigo de periódico
Título alternativo
Primeiro orientador
Membros da banca
Resumo
Currently Mutual Information has been widely used in pattern recognition and feature selection problems. It may be used as a measure of redundancy between features as well as a measure of dependency evaluating the relevance of each feature. Since marginal densities of real datasets are not usually known in advance, mutual information should be evaluated by estimation. There are mutual information estimators in the literature that were specifically designed for continuous or for discrete variables, however, most real problems are composed by a mixture of both. There is, of course, some implicit loss of information when using one of them to deal with mixed continuous and discrete variables. This paper presents a new estimator that is able to deal with mixed set of variables. It is shown in experiments with synthetic and real datasets that the method yields reliable results in such circumstance.
Abstract
Assunto
Inteligência artificial, Análise numérica - Processamento de dados
Palavras-chave
inteligência computacional, Artificial Intelligence, Semissupervisionado, Mutual Information, Feature Selection, information estimators that were specifically designed for continuous and for discrete variables
Citação
Departamento
Curso
Endereço externo
https://link.springer.com/article/10.1080/18756891.2016.1204120