DOD-ETL: distributed on-demand ETL for near real-time business intelligence

dc.creatorGustavo V. Machado
dc.creatorÍtalo Cunha
dc.creatorAdriano Cesar Machado Pereira
dc.creatorLeonardo B. Oliveira
dc.date.accessioned2024-05-22T22:00:23Z
dc.date.accessioned2025-09-08T23:23:04Z
dc.date.available2024-05-22T22:00:23Z
dc.date.issued2019-11-20
dc.format.mimetypepdf
dc.identifier.doihttps://doi.org/10.1186/s13174-019-0121-z
dc.identifier.issn1867-4828
dc.identifier.urihttps://hdl.handle.net/1843/68565
dc.languageeng
dc.publisherUniversidade Federal de Minas Gerais
dc.relation.ispartofJournal of Internet Services and Applications
dc.rightsAcesso Aberto
dc.subjectCiência da Computação
dc.subjectBig data
dc.subjectBusiness intelligence
dc.subject.otherNear real-time ETL
dc.subject.otherBusiness intelligence
dc.subject.otherBig data
dc.titleDOD-ETL: distributed on-demand ETL for near real-time business intelligence
dc.typeArtigo de periódico
local.citation.epage15
local.citation.issue21
local.citation.spage1
local.citation.volume10
local.description.resumoThe competitive dynamics of the globalized market demand information on the internal and external reality of corporations. Information is a precious asset and is responsible for establishing key advantages to enable companies to maintain their leadership. However, reliable, rich information is no longer the only goal. The time frame to extract information from data determines its usefulness. This work proposes DOD-ETL, a tool that addresses, in an innovative manner, the main bottleneck in Business Intelligence solutions, the Extract Transform Load process (ETL), providing it in near real-time. DOD-ETL achieves this by combining an on-demand data stream pipeline with a distributed, parallel and technology-independent architecture with in-memory caching and efficient data partitioning. We compared DOD-ETL with other Stream Processing frameworks used to perform near real-time ETL and found DOD-ETL executes workloads up to 10 times faster. We have deployed it in a large steelworks as a replacement for its previous ETL solution, enabling near real-time reports previously unavailable.
local.identifier.orcidhttp://orcid.org/0000-0002-6433-171X
local.publisher.countryBrasil
local.publisher.departmentICEX - INSTITUTO DE CIÊNCIAS EXATAS
local.publisher.departmentICX - DEPARTAMENTO DE CIÊNCIA DA COMPUTAÇÃO
local.publisher.initialsUFMG
local.url.externahttps://jisajournal.springeropen.com/articles/10.1186/s13174-019-0121-z

Arquivos

Pacote original

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
DOD-ETL_ distributed on-demand ETL for near real-time business intelligence.pdf
Tamanho:
743.24 KB
Formato:
Adobe Portable Document Format

Licença do pacote

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
License.txt
Tamanho:
1.99 KB
Formato:
Plain Text
Descrição: