Fast Autoscheduling for Sparse ML Frameworks (ajroot.pl)

0 points 136 days ago ago | visit original

🤖 AI Summary

Researchers from Stanford University and Google have unveiled a significant advancement in optimizing sparse tensor computations in deep learning frameworks. The new method, implemented in a prototype compiler called Scorch for PyTorch, introduces three key algorithms: a heuristic-based loop ordering algorithm that dramatically reduces compilation time from hours to milliseconds, a specialized tiling algorithm designed for efficient mixed sparse-dense computations, and a format inference algorithm that dynamically selects optimal sparse tensor formats. These innovations collectively improve performance while adhering to the interactive development needs crucial for machine learning research. This development is particularly important for the AI and machine learning community as it addresses the challenges posed by the increasing size of deep learning models, which demand more efficient computation techniques. The Scorch compiler demonstrates impressive end-to-end speedups of 1.05-5.80 times over existing solutions like PyTorch Sparse, particularly in tasks such as graph neural networks and sparse transformers. By enabling faster and more efficient sparse computations, this work not only paves the way for more complex models but also enhances the practicality and accessibility of sparse operations in deep learning workflows.

Loading comments...

loading comments...