Sparse MTTKRP Acceleration for Tensor Decomposition on GPU

Wijeratne, Sasindu; Kannan, Rajgopal; Prasanna, Viktor

doi:10.1145/3649153.3649187

Full-text links:

Download:

Current browse context:

cs.DC

< prev | next >

new | recent | 2405

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Sparse MTTKRP Acceleration for Tensor Decomposition on GPU

Authors: Sasindu Wijeratne, Rajgopal Kannan, Viktor Prasanna

(Submitted on 14 May 2024)

Abstract: Sparse Matricized Tensor Times Khatri-Rao Product (spMTTKRP) is the bottleneck kernel of sparse tensor decomposition. In this work, we propose a GPU-based algorithm design to address the key challenges in accelerating spMTTKRP computation, including (1) eliminating global atomic operations across GPU thread blocks, (2) avoiding the intermediate values being communicated between GPU thread blocks and GPU global memory, and (3) ensuring a balanced distribution of workloads across GPU thread blocks. Our approach also supports dynamic tensor remapping, enabling the above optimizations in all the modes of the input tensor. Our approach achieves a geometric mean speedup of 1.5x, 2.0x, and 21.7x in total execution time across widely used datasets compared with the state-of-the-art GPU implementations. Our work is the only GPU implementation that can support tensors with modes greater than 4 since the state-of-the-art works have implementation constraints for tensors with a large number of modes.

Comments:	In 21st ACM International Conference on Computing Frontiers (CF '24), May 7-9, 2024, Ischia, Italy
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Hardware Architecture (cs.AR)
DOI:	10.1145/3649153.3649187
Cite as:	arXiv:2405.08470 [cs.DC]
	(or arXiv:2405.08470v1 [cs.DC] for this version)

Submission history

From: Sasindu Wijeratne [view email]
[v1] Tue, 14 May 2024 09:51:27 GMT (5158kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2405.08470

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Distributed, Parallel, and Cluster Computing

Title: Sparse MTTKRP Acceleration for Tensor Decomposition on GPU

Submission history