Enriching the E2E dataset

dc.creatorThiago Castro Ferreira
dc.creatorHelena Vaz
dc.creatorBrian Davis
dc.creatorAdriana Silvina Pagano
dc.date.accessioned2023-08-04T20:47:34Z
dc.date.accessioned2025-09-08T23:29:53Z
dc.date.available2023-08-04T20:47:34Z
dc.date.issued2021
dc.description.sponsorshipCNPq - Conselho Nacional de Desenvolvimento Científico e Tecnológico
dc.description.sponsorshipFAPEMIG - Fundação de Amparo à Pesquisa do Estado de Minas Gerais
dc.description.sponsorshipCAPES - Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
dc.format.mimetypepdf
dc.identifier.isbn978195408510
dc.identifier.urihttps://hdl.handle.net/1843/57496
dc.languageeng
dc.publisherUniversidade Federal de Minas Gerais
dc.relation.ispartofInternational Conference on Natural Language Generation
dc.rightsAcesso Aberto
dc.subjectCiência da Computação
dc.subjectLinguística de corpus
dc.subjectProcessamento da linguagem natural (Computação)
dc.titleEnriching the E2E dataset
dc.typeArtigo de evento
local.citation.epage183
local.citation.issue14
local.citation.spage177
local.description.resumoThis study introduces an enriched version of the E2E dataset, one of the most popular language resources for data-to-text NLG. We extract intermediate representations for popular pipeline tasks such as discourse ordering, text structuring, lexicalization and referring expression generation, enabling researchers to rapidly develop and evaluate their data-to-text pipeline systems. The intermediate representations are extracted by aligning nonlinguistic and text representations through a process called delexicalization, which consists in replacing input referring expressions to entities/attributes with placeholders. The enriched dataset is publicly available.
local.identifier.orcidhttps://orcid.org/0000-0003-0200-3646
local.identifier.orcidhttps://orcid.org/0000-0001-9754-1425
local.identifier.orcidhttps://orcid.org/0000-0002-5759-2655
local.identifier.orcidhttps://orcid.org/0000-0002-3150-3503
local.publisher.countryBrasil
local.publisher.departmentFALE - FACULDADE DE LETRAS
local.publisher.departmentICX - DEPARTAMENTO DE CIÊNCIA DA COMPUTAÇÃO
local.publisher.initialsUFMG
local.url.externahttps://aclanthology.org/2021.inlg-1.18.pdf

Arquivos

Pacote original

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
Enriching the E2E dataset.pdf
Tamanho:
173.86 KB
Formato:
Adobe Portable Document Format

Licença do pacote

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
License.txt
Tamanho:
1.99 KB
Formato:
Plain Text
Descrição: