Use este identificador para citar o ir al link de este elemento:
http://hdl.handle.net/1843/65891
Tipo: | Artigo de Evento |
Título: | Building the first English-Brazilian Portuguese: corpus for automatic post-editing |
Autor(es): | Felipe de Almeida Costa Thiago Castro Ferreira Adriana Silvina Pagano Wagner Meira Junior |
Resumen: | This paper introduces the first corpus for Automatic Post-Editing of English and a low-resource language, Brazilian Portuguese. The source English texts were extracted from the WebNLG corpus and automatically translated into Portuguese using a state-of-the-art industrial neural machine translator. Post-edits were then obtained in an experiment with native speakers of Brazilian Portuguese. To assess the quality of the corpus, we performed error analysis and computed complexity indicators measuring how difficult the APE task would be. We report preliminary results of Phrase-Based and Neural Machine Translation Models on this new corpus. Data and code publicly available in our repository. |
Asunto: | Linguística de corpus Tradução mecânica |
Idioma: | eng |
País: | Brasil |
Editor: | Universidade Federal de Minas Gerais |
Sigla da Institución: | UFMG |
Departamento: | FALE - FACULDADE DE LETRAS |
Tipo de acceso: | Acesso Aberto |
Identificador DOI: | https://doi.org/10.18653/v1/2020.coling-main.533 |
URI: | http://hdl.handle.net/1843/65891 |
Fecha del documento: | dic-2020 |
metadata.dc.relation.ispartof: | International Conference on Computational Linguistics |
Aparece en las colecciones: | Artigo de Evento |
archivos asociados a este elemento:
archivo | Descripción | Tamaño | Formato | |
---|---|---|---|---|
Building the first English-Brazilian Portuguese corpus for automatic post-editing.pdf | 161.91 kB | Adobe PDF | Visualizar/Abrir |
Los elementos en el repositorio están protegidos por copyright, con todos los derechos reservados, salvo cuando es indicado lo contrario.