Identifying key developers in software projects using code authorship metrics

dc.creatorGuilherme Amaral Avelino
dc.date.accessioned2019-08-10T21:51:40Z
dc.date.accessioned2025-09-09T00:19:51Z
dc.date.available2019-08-10T21:51:40Z
dc.date.issued2018-06-21
dc.description.abstractCode authorship is a key information in software projects. However, its practical usage in such projects is not widely explored. Therefore, in this thesis, we first define several authorship-centric concepts, which we use to investigate the development teams of 115 open source projects, including an in-depth analysis of the Linux kernel. After, we use code authorship metrics to address two well-known software engineering problems: (1) to assess knowledge concentration in software projects and (2) to identify skilled developers to maintain specific source code files. To address the first problem, we propose an algorithm to estimate truck factors (TF), a concept widely used by practitioners to reveal key project members. We use this algorithm to detect TF events in 1,932 projects and to reveal the practices that help them to overcome such events. Finally, to address the second problem, we investigate the effectiveness of authorship metrics to identify skilled maintainers in 10 projects.
dc.identifier.urihttps://hdl.handle.net/1843/ESBF-B36HQW
dc.languagePortuguês
dc.publisherUniversidade Federal de Minas Gerais
dc.rightsAcesso Aberto
dc.subjectCode autorship
dc.subjectDesenvolvimento
dc.subjectTruck Factor
dc.subjectComputação
dc.subjectEngenharia de software
dc.subjectKey developers
dc.subject.othersoftware maintainers
dc.subject.otherdevelopment team
dc.subject.otherLinux kernel
dc.subject.othertruck factor
dc.subject.otherCode authorship
dc.titleIdentifying key developers in software projects using code authorship metrics
dc.typeTese de doutorado
local.contributor.advisor1Marco Tulio de Oliveira Valente
local.contributor.referee1Eduardo Magno Lages Figueiredo
local.contributor.referee1Mirella Moura Moro
local.contributor.referee1Fernando Jose Castor de Lima Filho
local.description.resumoCode authorship is a key information in software projects. However, its practical usage in such projects is not widely explored. Therefore, in this thesis, we first define several authorship-centric concepts, which we use to investigate the development teams of 115 open source projects, including an in-depth analysis of the Linux kernel. After, we use code authorship metrics to address two well-known software engineering problems: (1) to assess knowledge concentration in software projects and (2) to identify skilled developers to maintain specific source code files. To address the first problem, we propose an algorithm to estimate truck factors (TF), a concept widely used by practitioners to reveal key project members. We use this algorithm to detect TF events in 1,932 projects and to reveal the practices that help them to overcome such events. Finally, to address the second problem, we investigate the effectiveness of authorship metrics to identify skilled maintainers in 10 projects.
local.publisher.initialsUFMG

Arquivos

Pacote original

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
guilhermeamaralavelino.pdf
Tamanho:
1.59 MB
Formato:
Adobe Portable Document Format