CUDA-based parallelization of power iteration clustering for large datasets

Gustavo Rodrigues Lacerda Silva; Rafael Ribeiro de Medeiros; Brayan Rene Acevedo Jaimes; Carla Caldeira Takahashi; Douglas Alexandre Gomes Vieira; Antônio de Pádua Braga

doi:https://doi.org/10.1109/ACCESS.2017.2765380

CUDA-based parallelization of power iteration clustering for large datasets

dc.creator	Gustavo Rodrigues Lacerda Silva
dc.creator	Rafael Ribeiro de Medeiros
dc.creator	Brayan Rene Acevedo Jaimes
dc.creator	Carla Caldeira Takahashi
dc.creator	Douglas Alexandre Gomes Vieira
dc.creator	Antônio de Pádua Braga
dc.date.accessioned	2025-04-04T13:43:35Z
dc.date.accessioned	2025-09-09T00:16:28Z
dc.date.available	2025-04-04T13:43:35Z
dc.date.issued	2017
dc.identifier.doi	https://doi.org/10.1109/ACCESS.2017.2765380
dc.identifier.issn	2169-3536
dc.identifier.uri	https://hdl.handle.net/1843/81294
dc.language	eng
dc.publisher	Universidade Federal de Minas Gerais
dc.relation.ispartof	IEEE Access
dc.rights	Acesso Aberto
dc.subject	Otimização matemática
dc.subject	Banco de dados
dc.subject.other	Graphics processing units , Clustering algorithms , Kernel , Eigenvalues and eigenfunctions , Clustering methods , Instruction sets , Symmetric matrices
dc.subject.other	Scalable machine learning algorithms , GPU , power iteration clustering
dc.subject.other	Large Datasets , Parallelization , Clustering Algorithm , Real Applications , Graphics Processing Unit , Clustering Quality , Good Scalability , Clustering Method , Image Segmentation , Massive Data , Row Vector , Graphical User Interface , Intel Xeon , Aerial Images , GB Memory , Spectral Method , Order Of Complexity , Spectral Clustering , Affinity Matrix , Code Version , Graphics Processing Unit Memory , Shared Memory , Spectral Clustering Method , Parallel Implementation , Dominant Eigenvalue , Hardware Configuration , Set Of Kernels , Projection Matrix
dc.title	CUDA-based parallelization of power iteration clustering for large datasets
dc.type	Artigo de periódico
local.citation.epage	1
local.citation.spage	1
local.citation.volume	5
local.description.resumo	This paper presents a new clustering algorithm, the GPIC, a graphics processing unit (GPU) accelerated algorithm for power iteration clustering (PIC). Our algorithm is based on the original PIC proposal, adapted to take advantage of the GPU architecture, maintaining the algorithm’s original properties. The proposed method was compared against the serial implementation, achieving a considerable speedup in tests with synthetic and real data sets. A significant volume of real data application ( >107 records) was used, and we identified that GPIC implementation has good scalability to handle data sets with millions of data points. Our implementation efforts are directed towards two aspects: to process large data sets in less time and to maintain the same quality of the clusters results generated by the original PIC version.
local.publisher.country	Brasil
local.publisher.department	ENG - DEPARTAMENTO DE ENGENHARIA ELETRÔNICA
local.publisher.initials	UFMG
local.url.externa	https://ieeexplore.ieee.org/document/8078163

Arquivos

Pacote original

Agora exibindo 1 - 1 de 1

Nome:: CUDA-based parallelization of power iteration clustering for large datasets.pdf
Tamanho:: 3.59 MB
Formato:: Adobe Portable Document Format

Baixar

Licença do pacote

Agora exibindo 1 - 1 de 1

Nome:: License.txt
Tamanho:: 1.99 KB
Formato:: Plain Text
Descrição:

Baixar

Coleções

Artigo de Periódico