Gismo for Ray: A Multi-Node Shared Memory Object Store That Accelerates Ray Workloads

Описание к видео Gismo for Ray: A Multi-Node Shared Memory Object Store That Accelerates Ray Workloads

Ray is a powerful distributed computing framework. However, as data sets grow and computation requirements become more complex, managing memory usage across multiple computing nodes becomes increasingly challenging. Issues that slow down performance include the data copying between the computing nodes, data spilling out of memory into storage, and the data skew among computing nodes. We'll introduce Gismo, a multi-node shared memory object store based on Compute Express Link (CXL) technology to address this challenge. With the help of Gismo, Ray does not need to serialize and transfer the extra copies of the objects across network. Data spill and skew issues will be minimized because each host has direct memory access to the whole object store. This talk will demonstrate how Gismo is integrated with Ray and showcase how it improves overall performance and reduces memory overhead for Ray users.

Find the slide deck here: https://drive.google.com/file/d/1Be3_...


About Anyscale
---
Anyscale is the AI Application Platform for developing, running, and scaling AI.

https://www.anyscale.com/

If you're interested in a managed Ray service, check out:
https://www.anyscale.com/signup/

About Ray
---
Ray is the most popular open source framework for scaling and productionizing AI workloads. From Generative AI and LLMs to computer vision, Ray powers the world’s most ambitious AI workloads.
https://docs.ray.io/en/latest/


#llm #machinelearning #ray #deeplearning #distributedsystems #python #genai

Комментарии

Информация по комментариям в разработке