Use este identificador para citar o ir al link de este elemento:
http://hdl.handle.net/1843/83345
Registro completo de metadatos
Campo DC | Valor | Idioma |
---|---|---|
dc.creator | André Coneglian | pt_BR |
dc.creator | Ana Luisa A. R. Guimarães | pt_BR |
dc.creator | Thiago Castro Ferreira | pt_BR |
dc.creator | Adriana Silvina Pagano | pt_BR |
dc.date.accessioned | 2025-07-04T20:29:50Z | - |
dc.date.available | 2025-07-04T20:29:50Z | - |
dc.date.issued | 2022 | - |
dc.citation.spage | 1 | pt_BR |
dc.citation.epage | 11 | pt_BR |
dc.identifier.uri | http://hdl.handle.net/1843/83345 | - |
dc.description.resumo | This paper reports on an exploratory study of a sample of 175 sentences retrieved from the renowned Brazilian novel Grande Sertão: Veredas [Portuguese for Great Backlands: Paths; English translation: The devil to pay in the backlands], which were annotated for POS and syntactic relations following the Universal Dependencies guidelines. The study aimed to explore the feasibility of annotating non-canonical text to create treebanks for Brazilian Portuguese. We computed accuracy and precision of the model in order to verify categories annotated more and less successfully. The results show the model performed slightly better for POS than dependency relations and pointed out categories with higher demand for manual revision as being those related to orality phenomena represented by Guimarães Rosa in his novel. The study shows the potential of annotating noncanonical text to enhance existing models with categories less represented in the treebanks. | pt_BR |
dc.format.mimetype | pt_BR | |
dc.language | por | pt_BR |
dc.publisher | Universidade Federal de Minas Gerais | pt_BR |
dc.publisher.country | Brasil | pt_BR |
dc.publisher.department | FALE - FACULDADE DE LETRAS | pt_BR |
dc.publisher.initials | UFMG | pt_BR |
dc.relation.ispartof | Universal Dependencies Brazilian Festival | pt_BR |
dc.rights | Acesso Aberto | pt_BR |
dc.subject | Universal Dependencies | pt_BR |
dc.subject | Non-canonical text | pt_BR |
dc.subject | Brazilian Portuguese | pt_BR |
dc.subject.other | Processamento da linguagem natural (Computação) | pt_BR |
dc.subject.other | Linguística de corpus | pt_BR |
dc.title | Anotação de textos não canônicos: um estudo exploratorio de Grande sertão: veredas pelas dependências universais | pt_BR |
dc.type | Artigo de Evento | pt_BR |
dc.url.externa | https://aclanthology.org/2022.udfestbr-1.1.pdf | pt_BR |
dc.identifier.orcid | https://orcid.org/0000-0002-4172-5521 | pt_BR |
dc.identifier.orcid | https://orcid.org/0000-0001-7278-6856 | pt_BR |
dc.identifier.orcid | https://orcid.org/0000-0003-0200-3646 | pt_BR |
dc.identifier.orcid | https://orcid.org/0000-0002-3150-3503 | pt_BR |
Aparece en las colecciones: | Artigo de Evento |
archivos asociados a este elemento:
archivo | Descripción | Tamaño | Formato | |
---|---|---|---|---|
Anotação de textos não canônicos um estudo exploratorio de Grande sertão veredas pelas dependências universais.pdf | 474.55 kB | Adobe PDF | Visualizar/Abrir |
Los elementos en el repositorio están protegidos por copyright, con todos los derechos reservados, salvo cuando es indicado lo contrario.