From PoC to Production: Deploying Gen AI workloads on AWS Inferentia | AWS Infrastructure Day 2024

Описание к видео From PoC to Production: Deploying Gen AI workloads on AWS Inferentia | AWS Infrastructure Day 2024

Are you conducting a proof of concept (PoC) for a generative AI project? Have you faced challenges with moving from PoC to production? In this session Media Monks outlines how they got from PoC to production quickly when creating a generative AI application for a media campaign. They cover their decision-making process in selecting Inferentia 2 from the many infrastructure options, the key performance indicators they tracked, architectural considerations, and the deployment process.

Learn more about Inferentia https://awsonair.net/3zpuOV7 and Trainium https://awsonair.net/3RGx5BV

Follow AWS OnAir:
LinkedIn: https://bit.ly/AWSOnAir-LinkedIn
Twitch: https://bit.ly/Twitch-AWS-OnAir

ABOUT AWS
Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world’s most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

#AWSEvents #GenerativeAI #LLM #FoundationModel #PoC #production #amazonec2 #ec2 #inferentia #trainium #amgrobelny #jasminekyles

Комментарии

Информация по комментариям в разработке