Anotação de textos não canônicos: um estudo exploratorio de Grande sertão: veredas pelas dependências universais

dc.creatorAndré Coneglian
dc.creatorAna Luisa A. R. Guimarães
dc.creatorThiago Castro Ferreira
dc.creatorAdriana Silvina Pagano
dc.date.accessioned2025-07-04T20:29:50Z
dc.date.accessioned2025-09-09T01:24:11Z
dc.date.available2025-07-04T20:29:50Z
dc.date.issued2022
dc.format.mimetypepdf
dc.identifier.urihttps://hdl.handle.net/1843/83345
dc.languagepor
dc.publisherUniversidade Federal de Minas Gerais
dc.relation.ispartofUniversal Dependencies Brazilian Festival
dc.rightsAcesso Aberto
dc.subjectProcessamento da linguagem natural (Computação)
dc.subjectLinguística de corpus
dc.subject.otherUniversal Dependencies
dc.subject.otherNon-canonical text
dc.subject.otherBrazilian Portuguese
dc.titleAnotação de textos não canônicos: um estudo exploratorio de Grande sertão: veredas pelas dependências universais
dc.typeArtigo de evento
local.citation.epage11
local.citation.spage1
local.description.resumoThis paper reports on an exploratory study of a sample of 175 sentences retrieved from the renowned Brazilian novel Grande Sertão: Veredas [Portuguese for Great Backlands: Paths; English translation: The devil to pay in the backlands], which were annotated for POS and syntactic relations following the Universal Dependencies guidelines. The study aimed to explore the feasibility of annotating non-canonical text to create treebanks for Brazilian Portuguese. We computed accuracy and precision of the model in order to verify categories annotated more and less successfully. The results show the model performed slightly better for POS than dependency relations and pointed out categories with higher demand for manual revision as being those related to orality phenomena represented by Guimarães Rosa in his novel. The study shows the potential of annotating noncanonical text to enhance existing models with categories less represented in the treebanks.
local.identifier.orcidhttps://orcid.org/0000-0002-4172-5521
local.identifier.orcidhttps://orcid.org/0000-0001-7278-6856
local.identifier.orcidhttps://orcid.org/0000-0003-0200-3646
local.identifier.orcidhttps://orcid.org/0000-0002-3150-3503
local.publisher.countryBrasil
local.publisher.departmentFALE - FACULDADE DE LETRAS
local.publisher.initialsUFMG
local.url.externahttps://aclanthology.org/2022.udfestbr-1.1.pdf

Arquivos

Pacote original

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
Anotação de textos não canônicos um estudo exploratorio de Grande sertão veredas pelas dependências universais.pdf
Tamanho:
474.55 KB
Formato:
Adobe Portable Document Format

Licença do pacote

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
License.txt
Tamanho:
1.99 KB
Formato:
Plain Text
Descrição: