Comparative genomics and pan-genomic study of genus Corynebacterium
Carregando...
Arquivos
Data
Autor(es)
Título da Revista
ISSN da Revista
Título de Volume
Editor
Universidade Federal de Minas Gerais
Descrição
Tipo
Tese de doutorado
Título alternativo
Primeiro orientador
Membros da banca
Rommel Thiago Juca Ramos
Arthur Gruber
Gabriel da Rocha Fernandes
Jose Miguel Ortega
Arthur Gruber
Gabriel da Rocha Fernandes
Jose Miguel Ortega
Resumo
The genomic area has distinguished itself between different areas of modern biology, mainly after the advent of new high-throughput sequencing technologies (Next-Generation Sequencing - NGS), which allowed the massive deposit of whole genomes in public databases. The deluge of sequenced data, then, required the development of new bioinformatics software, with the ability to analyze these data, culminating in the creation of new areas such as comparative genomics that involves the comparison of the genetic content of an organism against another and helping the prediction of gene function and coding region sequences, identification of evolutionary events and finding of phylogenetic relationships. In this scenario, pan-genome studies allow the comparison of a large number of related bacteria enabling infer correlated genes with a lifestyle, gene repertoires and minimal genome size. In this work, first we describe the genome sequence, assembly and annotation of the Corynebacterium urealyticum strain DSM7111 followed by the comparison of two C. urealyticum, an opportunistic pathogen normally isolated from skin and mucous membranes in humans. Altogether, the strains encode 2,115 non-redundant coding sequences, of which 1,823 are commonly shared by both. Additionally, we applied the Vaxign software and we found 19 putative antigenic proteins through the application of the reverse vaccinology approach. Moreover, we expanded the comparative genomic analyses to the genus Corynebacterium, using 44 genomes of 25 different species. The pan-genomic analyses revealed an open pan-genome, which is currently composed of 22,177 proteincoding genes; the predicted core genome is composed of 562 genes; and, the predicted singletons account for 8,762 genes. Phylogenomic analysis using different strategies revealed the occurrence of four different clusters in the taxonomic tree: one composed of pathogenic, one of non-pathogenic and two of opportunistic pathogenic species. Finally, we presented genomes of C. ulcerans, which will be importante for future pan-genomic studies of this species.
Abstract
Assunto
Genética
Palavras-chave
Genética