A mutual information estimator for continuous and discrete variables applied to feature selection and classification problems

Carregando...
Imagem de Miniatura

Data

Título da Revista

ISSN da Revista

Título de Volume

Editor

Universidade Federal de Minas Gerais

Descrição

Tipo

Artigo de periódico

Título alternativo

Primeiro orientador

Membros da banca

Resumo

Currently Mutual Information has been widely used in pattern recognition and feature selection problems. It may be used as a measure of redundancy between features as well as a measure of dependency evaluating the relevance of each feature. Since marginal densities of real datasets are not usually known in advance, mutual information should be evaluated by estimation. There are mutual information estimators in the literature that were specifically designed for continuous or for discrete variables, however, most real problems are composed by a mixture of both. There is, of course, some implicit loss of information when using one of them to deal with mixed continuous and discrete variables. This paper presents a new estimator that is able to deal with mixed set of variables. It is shown in experiments with synthetic and real datasets that the method yields reliable results in such circumstance.

Abstract

Assunto

Inteligência artificial, Análise numérica - Processamento de dados

Palavras-chave

inteligência computacional, Artificial Intelligence, Semissupervisionado, Mutual Information, Feature Selection, information estimators that were specifically designed for continuous and for discrete variables

Citação

Curso

Endereço externo

https://link.springer.com/article/10.1080/18756891.2016.1204120

Avaliação

Revisão

Suplementado Por

Referenciado Por