Explorando limitações das redes convolucionais na seleção de testes binários

Bernardo Janko Gonçalves Biesseck

Please use this identifier to cite or link to this item: http://hdl.handle.net/1843/ESBF-B8VGJG

Type:	Dissertação de Mestrado
Title:	Explorando limitações das redes convolucionais na seleção de testes binários
Authors:	Bernardo Janko Gonçalves Biesseck
First Advisor:	Erickson Rangel do Nascimento
First Referee:	Mario Fernando Montenegro Campos
Second Referee:	Flavio Luis Cardeal Padua
Third Referee:	Renato José Martins
Abstract:	A extração de características é uma etapa fundamental em sistemas de Visão Computacional, na qual os dados contidos nos pixels são transformados em feature vector robustos a variações de iluminação, rotação e escala. Durante muitos anos algoritmos baseados em histogramas de orientação de gradientes, como SIFT, HOG e SURF, foram os mais eficientes mas possuem custo computacional elevado. Para contornar este problema alguns algoritmos que geram vetores binários foram criados, como BRIEF, ORB, BRISK e FREAK. Cada um deles propõe uma distribuição espacial diferente de testes binários e o principal objetivo é manter um custo computacional baixo. Este trabalho investiga o problema de seleção de testes binários e apresenta algumas características que limitam a busca de soluções através de Redes Neurais Convolucionais (CNN). Experimentos realizados com uma Rede Siamesa mostram a presença de mínimos locais e um outro problema definido como Componentes Incorretas do Gradiente.
Abstract:	Feature extraction is a fundamental step in Computer Vision systems, which consists in transforming the raw pixels data into new representations robust to illumination, rotation and scale variations. These new representations are known as feature vectors and can be generated by different methodologies depending on the application requirements. For many years algorithms based on gradient orientation histograms, such as SIFT, HOG and SURF, were the most efficient for low level feature extraction but they have a high computational cost due mainly to the use of convolutions. To avoid this problem some algorithms have been created to generate binary vectors, such as BRIEF, ORB, BRISK and FREAK. They propose different spatial distributions of binary tests and the main goal is to keep a low cost to be executed on small and large computers. Recent advances in feature extraction have been obtained using deep learning and the results of Convolutional Neural Networks (CNN) outperformed the handcrafted descriptors. Different CNN architectures were created by researchers and some of them are projected to generate binary vectors, such as LIFT and DeepBit. Their strategies are focused in binarizing the Real values of output layer, which maintains a high computational cost. This work investigates the problem of selecting binary tests in order to discover different spatial distributions that allows the creation of a new binary descriptor. Some limiting properties for searching solutions with CNNs are presented, which appears when the gradient is calculated based on the local pixels neighborhood. Experiments were conducted using a Siamese Network trained with corresponding and non-corresponding patch pairs, whose results show the existence of local minimum and another problem defined in this dissertation as Incorrect Components of Gradient.
Subject:	Visão por computador Redes neurais (Computação) Computação
language:	Português
Publisher:	Universidade Federal de Minas Gerais
Publisher Initials:	UFMG
Rights:	Acesso Aberto
URI:	http://hdl.handle.net/1843/ESBF-B8VGJG
Issue Date:	10-Aug-2018
Appears in Collections:	Dissertações de Mestrado

Files in This Item:

File	Size	Format
bernardojankogoncalvesbiesseck.pdf	20.32 MB	Adobe PDF	View/Open

Show full item record