On the relation of privacy and fairness through the lenses of quantitative information flow

Bruno Demattos Nogueira

Please use this identifier to cite or link to this item: http://hdl.handle.net/1843/64585

Type:	Dissertação
Title:	On the relation of privacy and fairness through the lenses of quantitative information flow
Authors:	Bruno Demattos Nogueira
First Advisor:	Mário Sérgio Ferreira Alvim Júnior
First Referee:	Natasha Fernandes
Second Referee:	Catuscia Palamidessi
Abstract:	When developing a machine learning (ML) system, there are two common concerns besides the algorithm's performance. The first one is whether the system is fair, that is, if it treats individuals from different groups similarly, giving them similar classifications. The second is whether the system is private, that is, if it does not reveal private information about individuals on the training set when the output is shown to an observer. Initially, they were considered separately, but recently, the connection between these two concerns has gathered increased attention in the ML community. In this work, we will show an expansion of the quantitative information flow framework to fully describe which situations can happen in terms of fairness and privacy and model them as duals. After that, we model four different existing fairness notions using our framework. Ultimately, we describe experiments showing how our model behaves in real-world scenarios, testing it with different datasets and ML algorithms.
Abstract:	Ao desenvolver um sistema de aprendizado de máquina, existem duas preocupações além do desempenho do algoritmo. O primeiro é se o sistema é justo, isto é, se ele trata indivíduos de grupos distintos da mesma maneira, os classificando de forma similar. O segundo é se o sistema é privado, isto é, se ele não revela informações privadas de indivíduos que fazem parte do conjunto de treino quando a saída é exibida a um observador. Inicialmente, essas duas preocupações foram consideradas independentemente, mas recentemente, a conexão entre os dois tem atraído cada vez mais atenção na comunidade de aprendizado de máquina. Nesse trabalho, nós exibiremos uma expansão do arcabouço do fluxo de informação quantitativo para descrever de maneira completa todas as situações que podem ocorrer em termos de privacidade e justiça. Além disso, modelaremos essas duas quantidades como duais. Depois, modelaremos quatro métricas de justiça já existentes usando nosso arcabouço. Por fim, descreveremos experimentos que mostram como nosso modelo se comporta em cenários com dados reais, o testando com diferentes bases de dados e algoritmos.
Subject:	Computação – Teses Aprendizado do computador – Teses Teoria da informação - Teses Direito à Privacidade - Teses
language:	eng
metadata.dc.publisher.country:	Brasil
Publisher:	Universidade Federal de Minas Gerais
Publisher Initials:	UFMG
metadata.dc.publisher.department:	ICX - DEPARTAMENTO DE CIÊNCIA DA COMPUTAÇÃO
metadata.dc.publisher.program:	Programa de Pós-Graduação em Ciência da Computação
Rights:	Acesso Aberto
metadata.dc.rights.uri:	http://creativecommons.org/licenses/by/3.0/pt/
URI:	http://hdl.handle.net/1843/64585
Issue Date:	30-Nov-2023
Appears in Collections:	Dissertações de Mestrado

Files in This Item:

File	Description	Size	Format
Bruno Nogueira - Thesis.pdf		1.59 MB	Adobe PDF	View/Open

Show full item record

This item is licensed under a Creative Commons License