Please use this identifier to cite or link to this item: http://hdl.handle.net/1843/83345
Type: Artigo de Evento
Title: Anotação de textos não canônicos: um estudo exploratorio de Grande sertão: veredas pelas dependências universais
Authors: André Coneglian
Ana Luisa A. R. Guimarães
Thiago Castro Ferreira
Adriana Silvina Pagano
Abstract: This paper reports on an exploratory study of a sample of 175 sentences retrieved from the renowned Brazilian novel Grande Sertão: Veredas [Portuguese for Great Backlands: Paths; English translation: The devil to pay in the backlands], which were annotated for POS and syntactic relations following the Universal Dependencies guidelines. The study aimed to explore the feasibility of annotating non-canonical text to create treebanks for Brazilian Portuguese. We computed accuracy and precision of the model in order to verify categories annotated more and less successfully. The results show the model performed slightly better for POS than dependency relations and pointed out categories with higher demand for manual revision as being those related to orality phenomena represented by Guimarães Rosa in his novel. The study shows the potential of annotating noncanonical text to enhance existing models with categories less represented in the treebanks.
Subject: Processamento da linguagem natural (Computação)
Linguística de corpus
language: por
metadata.dc.publisher.country: Brasil
Publisher: Universidade Federal de Minas Gerais
Publisher Initials: UFMG
metadata.dc.publisher.department: FALE - FACULDADE DE LETRAS
Rights: Acesso Aberto
URI: http://hdl.handle.net/1843/83345
Issue Date: 2022
metadata.dc.url.externa: https://aclanthology.org/2022.udfestbr-1.1.pdf
metadata.dc.relation.ispartof: Universal Dependencies Brazilian Festival
Appears in Collections:Artigo de Evento



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.