[PLDI24] Compilation of Modular and General Sparse Workspaces

Описание к видео [PLDI24] Compilation of Modular and General Sparse Workspaces

Compilation of Modular and General Sparse Workspaces (Video, PLDI 2024)
Genghan Zhang, Olivia Hsu, and Fredrik Kjolstad
(Stanford University, USA; Stanford University, USA; Stanford University, USA)

Abstract: Recent years have seen considerable work on compiling sparse tensor algebra expressions. This paper addresses a shortcoming in that work, namely how to generate efficient code (in time and space) that scatters values into a sparse result tensor. We address this shortcoming through a compiler design that generates code that uses sparse intermediate tensors (sparse workspaces) as efficient adapters between compute code that scatters and result tensors that do not support random insertion. Our compiler automatically detects sparse scattering behavior in tensor expressions and inserts necessary intermediate workspace tensors. We present an algorithm template for workspace insertion that is the backbone of our code generation algorithm. Our algorithm template is modular by design, supporting sparse workspaces that span multiple user-defined implementations. Our evaluation shows that sparse workspaces can be up to 27.12× faster than the dense workspaces of prior work. On the other hand, dense workspaces can be up to 7.58× faster than the sparse workspaces generated by our compiler in other situations, which motivates our compiler design that supports both. Our compiler produces sequential code that is competitive with hand-optimized linear and tensor algebra libraries on the expressions they support, but that generalizes to any other expression. Sparse workspaces are also more memory efficient than dense workspaces as they compress away zeros. This compression can asymptotically decrease memory usage, enabling tensor computations on data that would otherwise run out of memory.

Article: https://doi.org/10.1145/3656426

ORCID: https://orcid.org/0000-0002-3866-8167, https://orcid.org/0000-0002-4195-8106, https://orcid.org/0000-0002-2267-903X

Video Tags: sparse tensor algebra, compilation, sparse workspaces, code composition, pldi24main-p300-p, doi:10.1145/3656426, orcid:0000-0002-3866-8167, orcid:0000-0002-4195-8106, orcid:0000-0002-2267-903X

Presentation at the PLDI 2024 conference, June 24–28, 2024, https://pldi24.sigplan.org/
Sponsored by ACM SIGPLAN,

Комментарии

Информация по комментариям в разработке