Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud

Скачать Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud бесплатно в качестве 4к (2к / 1080p)

У нас вы можете скачать бесплатно Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud или посмотреть видео с ютуба в максимальном доступном качестве.

Для скачивания выберите вариант из формы ниже:

Cкачать музыку Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud бесплатно в формате MP3:

Если иконки загрузки не отобразились, ПОЖАЛУЙСТА, НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если у вас возникли трудности с загрузкой, пожалуйста, свяжитесь с нами по контактам, указанным в нижней части страницы.
Спасибо за использование сервиса video2dn.com

Описание к видео Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud

Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud

Generative AI has become increasingly prevalent in recent years, and is reaching a critical point as the models are demonstrating human-level capabilities. However, serving these massive models have presented new technical challenges, as they contain hundreds of billions of model parameters and require massive computational resources. In this talk, we will discuss how to serve GenAI models using KubeRay on Kubernetes with hardware accelerators like GPUs and TPUs. Practitioners will learn how to get these large models into production on a performant and cost-effective Kubernetes platform. Ray is an open-source framework for distributed machine learning. It enables ML practitioners to scale their workloads out to large clusters of machines. Ray Serve offers a scalable and framework-agnostic library for online inference that’s suitable for large and complex models. The audience will learn how integrating Ray with accelerators can create a powerful platform for serving GenAI models.

Комментарии

Информация по комментариям в разработке

Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud

Скачать Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud бесплатно в качестве 4к (2к / 1080p)

Cкачать музыку Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud бесплатно в формате MP3:

Описание к видео Accelerate Your GenAI Model Inference with Ray and Kubernetes - Richard Liu, Google Cloud

Похожие видео