Automatic WLAN control using reinforcement learning for improved quality of experience

Henrique Duarte Moura

Please use this identifier to cite or link to this item: http://hdl.handle.net/1843/33548

Type:	Tese
Title:	Automatic WLAN control using reinforcement learning for improved quality of experience
Other Titles:	Controle automático de WLAN usando aprendizado por reforço para melhorar a qualidade da experiência
Authors:	Henrique Duarte Moura
First Advisor:	Daniel Fernandes Macedo
First Co-advisor:	Marcos Augusto Menezes Vieira
First Referee:	Adriano Alonso Veloso
Second Referee:	Steven Latré
Third Referee:	Alex Borges Vieira
metadata.dc.contributor.referee4:	José Marcos Silva Nogueira
Abstract:	Real-time performance demands, such as augmented reality, and high quality video streaming brought new challenges to network control, as users expect high levels of quality to be upheld. Improving the user satisfaction and minimizing customer turnover while still maintaining their competitive advantage can be a significant challenge for service providers. Quality of Experience (QoE) is hard to estimate because it is perceived subjectively by users, as a result of the user’s internal state (e.g., predispositions, expectations, needs, motivation, mood), the designed system characteristics (e.g., complexity, purpose, usability, functionality, relevance), and the context within which the service is experienced (e.g., organizational/social settings, meaningfulness of the activity, voluntariness of use). In this thesis, we focus on Wireless LANs (WLANs). WLAN has become commonplace in office, and campus sites, and are the most common method of Internet access for home uses. WLAN are subject to an already crowded radio spectrum, and the communication deteriorates due to the dynamics of the transmission medium, fading multipath channels, interferences, and misconfiguration. An automatic approach to network control should be developed, because: (i) WLAN users are non-technical; (ii) access providers do not perform direct or online control of the devices in the residences, for legal reasons or because of technical difficulties due to the large number of users; (iii) the increasing demand for more performance by the users; and (iv) the ISP’s need to maintain its competitiveness. To achieve that, a machine should learn from the execution of a particular task to maintain a specific performance metric, and based on past experience, improve the performance while executing a task. Reinforcement learning (RL) guides the learning of an agent using a reward-aware approach. It is very useful in networking, because RL algorithms perform decision making under unknown or very-hard-to-model network conditions. Then, learning algorithms can vary network parameters to meet some performance criteria, such as QoE, providing a more user-centric approach to network services. At the same, the Software Defined Networking (SDN) paradigm can be used to further improve the management of computer networks. SDN advocates the separation of the control and data planes, and the use of a (logically) centralized control. Those characteristics allow the network administrators to have a fuller view of the system, and thus perform global optimizations on the network. To achieve auto-configuration and auto-optimization capabilities with minimal or no human management, two prerequisites must be fulfilled: a control loop that gathers information from the environment and acts on the devices, and some QoE metric representing the user perceived satisfaction. In this thesis, we use Multi-Armed Bandit (MAB), Q-Learning (QL), and Deep QL as control loops, varying from a simple to a complex model. We built two prototypes, one for web browsing and one for video traffic.To evaluate the web browsing application, we use a QoE metric from the literature, composed by three regressors (QoE predictors), one for each of three classes of web sites. We proposed a classification algorithm using semi-supervised learning, which correlates the example of the site classes provided by the authors of the predictor. Next, we developed a QoE predictor as well as a Deep QL control loop for video traffic. results show that the control loops improve QoE. In the web browsing scenario, our best result improved the QoE by 167%, if compared to the baselines, and improved the page load times by up to 6.6 times. In the video streaming evaluation the control loop improves the QoE by 91% compared to the baselines.
Subject:	Computação - Teses Aprendizado de máquina. Sistema de comunicação sem fio. Redes de computador
language:	eng
metadata.dc.publisher.country:	Brasil
Publisher:	Universidade Federal de Minas Gerais
Publisher Initials:	UFMG
metadata.dc.publisher.department:	ICX - DEPARTAMENTO DE CIÊNCIA DA COMPUTAÇÃO
metadata.dc.publisher.program:	Programa de Pós-Graduação em Ciência da Computação
Rights:	Acesso Aberto
metadata.dc.rights.uri:	http://creativecommons.org/licenses/by-nc-nd/3.0/pt/
URI:	http://hdl.handle.net/1843/33548
Issue Date:	20-Dec-2019
Appears in Collections:	Dissertações de Mestrado

Files in This Item:

File	Description	Size	Format
Tese___PPGCC___Henrique_Moura-final.pdf	Thesis	24.95 MB	Adobe PDF	View/Open

Show full item record

This item is licensed under a Creative Commons License