Understanding the search space of methods for automatically designing graph neural networks

Matheus Henrique do Nascimento Nunes

Please use this identifier to cite or link to this item: http://hdl.handle.net/1843/47526

Type:	Dissertação
Title:	Understanding the search space of methods for automatically designing graph neural networks
Other Titles:	Uma análise do espaço de busca de métodos para o design automático de graph neural networks
Authors:	Matheus Henrique do Nascimento Nunes
First Advisor:	Gisele Lobo Pappa
First Referee:	Fabrício Murai Ferreira
Second Referee:	Nuno Lourenço
Abstract:	Graph-structured data has become increasingly available and, due to its ubiquity, an object of study in many areas of research. Due to the absence of the notion of sequence in graphs, Machine Learning (ML) methods have historically struggled to work on this data. Specialized methods for performing ML over graph data have gained a lot of attention from the research community, especially Graph Neural Networks (GNNs), which have been extensively used over real-world data, achieving state-of-the-art results in tasks such as circuit design, movie recommendation, and anomaly detection. Many GNN models have been recently proposed, and choosing the best model for each problem has become a cumbersome and error-prone task. Aiming at mitigating this problem, recent works have proposed strategies for applying Neural Architecture Search (NAS) - a set of methods designed to automatically configure neural networks, very successful on Convolutional Neural Networks, that deal with image data - to GNN models. Automatically configured GNNs have achieved high performance results, surpassing human-crafted ones. However, the NAS for GNNs literature is still in its early stages, and methods that have been successfully applied for NAS in CNNs have yet to be tested on GNNs as well. In this work we have conducted a comprehensive comparative analysis of a proposed Evolutionary Algorithm against a literature Reinforcement Learning and a simple Random Search baseline, considering 7 real-world datasets and two search spaces. We have shown that Random Search is just as effective in finding good performing architectures as other more complex methods. Another interesting finding is that all three search methods converge very early in the search (in about 10% of the budget). We hypothesized that this might have been happening due to the presence of Neutrality (regions in which all solutions have very similar performance values) in the search space. Shifting the focus from the first part of this work, in the second part we have conducted an extensive visual and analytical evaluation of one of the literature's search spaces, using dimensionality reduction and Fitness Landscape Analysis techniques. We have demonstrated that the search space for NAS in GNNs presents high searchability (i.e. it is not difficult for algorithms to explore and find a suitable solution) and neutrality (i.e. there are many regions in the search space in which the performance of the neighboring solutions are relatively equal). We hypothesize that in the future, less expensive methods can be used to perform the optimization in this scenario without loss of generality.
Abstract:	Dados estruturados em formato de grafos têm se tornado cada vez mais disponíveis, e devido à sua ubiquidade, têm se tornado objeto de estudo em várias áreas de pesquisa. Dada a ausência da noção de sequência entre elementos em um grafo, algoritmos de Aprendizado de Máquina (ML, em inglês) têm historicamente enfrentado dificuldades em trabalhar com este tipo de dados. Métodos especializados para grafos têm ganhado atenção da comunidade de pesquisa recentemente, especialmente as Redes Neurais para Grafos (GNNs, em inglês), que têm sido extensivamente utilizadas em dados reais, alcançando resultados estado-da-arte em tarefas como projeto de circuitos, recomendação de filmes e detecção de anomalias. Uma gama de modelos de GNN foi proposta recentemente, e escolher o melhor modelo para cada tarefa tem se tornado uma tarefa complicada e propensa a erros. Objetivando mitigar este problema, trabalhos recentes têm investigado estratégias para aplicar Busca de Arquitetura Neurais (NAS, em inglês) - um conjunto de métodos projetados para automaticamente configurar redes neurais, que têm obtido muito sucesso em Redes Neurais Convolucionais (CNNs, em inglês), que lidam com imagens - para modelos de GNN. GNNs automaticamente configuradas têm alcançado bons resultados em performance, superando redes configuradas por humanos. Porém, a literatura de NAS para GNNs ainda está em seus estágios iniciais, e métodos que foram aplicados com sucesso para NAS em CNNs, ainda não foram testados para GNNs. O foco deste trabalho é conduzir uma análise comparativa compreensiva de um Algoritmo Evolucionario proposto, contra um algoritmo de Aprendizado por Reforço da literatura, e uma Busca Aleatória como baseline, considerando 7 datasets reais, e dois espaços de busca. É demonstrado que a Busca Aleatória é tão efetiva quanto outros métodos mais complexos, em encontrar boas arquiteturas de GNN. Outro achado interessante é de que todos os três métodos convergem bem cedo na busca (utilizando aproximadamente 10% da cota). A hipótese é de que isto acontece devido à presença de Neutralidade no espaço (regiões do espaço em que todas as soluções tem valores de performance parecidas). Em uma segunda etapa do trabalho, o foco é em conduzir uma avaliação visual e analítica extensa de um dos espaços de busca da literatura, usando técnicas de redução de dimensionalidade e Fitness Landscape Analysis (FLA). É demonstrado que o espaço de busca para NAS em GNNs apresenta grande "Buscabilidade" (i.e. não é difícil para algoritmos explorar o espaço e encontrar boas soluções) e "Neutralidade" (i.e. existem várias regiões do espaço em que a performance de soluções vizinhas é relativamente igual). A hipótese é de que, no futuro, métodos menos computacionalmente custosos possam ser empregados para esta tarefa sem perda de generalidade.
Subject:	Computação - Teses Redes neurais ( Computação) - Teses Aprendizado de máquina - Teses
language:	eng
metadata.dc.publisher.country:	Brasil
Publisher:	Universidade Federal de Minas Gerais
Publisher Initials:	UFMG
metadata.dc.publisher.department:	ICX - DEPARTAMENTO DE CIÊNCIA DA COMPUTAÇÃO
metadata.dc.publisher.program:	Programa de Pós-Graduação em Ciência da Computação
Rights:	Acesso Aberto
URI:	http://hdl.handle.net/1843/47526
Issue Date:	7-Dec-2021
Appears in Collections:	Dissertações de Mestrado

Files in This Item:

File	Description	Size	Format
dissertacao_fixed_pdfa.pdf		9.55 MB	Adobe PDF	View/Open

Show full item record