Using Gemini Pro Vision for multimodal use cases with text, images, and videos

Описание к видео Using Gemini Pro Vision for multimodal use cases with text, images, and videos

What are the applications of multimodality with Gemini? This session will cover a variety of different multimodal use cases for text, images, and video, and provide some ideas on how to apply multimodality to practical business scenarios. You'll also gain experience with Gemini Pro Vision.

Try Gemini in Vertex AI → https://goo.gle/3Vttolh

To complete this workshop, you will need a laptop and a Google Cloud Project.

Walk through an interactive notebook with multimodal use cases with Gemini → https://goo.gle/4b98tbY
Learn about multimodal prompts in the Gemini documentation → https://goo.gle/4aNzaTV
Try out multimodal capabilities in Gemini Pro Vision to create a retail recommendation system → https://goo.gle/49PRc6I

NOTE: Cloud Credits discussed in this session or workshop were for live audiences only

Speakers: Lavi Nigam, Katie Nguyen

Watch more:
Check out all the AI videos at Google I/O 2024 → https://goo.gle/io24-ai-yt

Subscribe to Google Developers → https://goo.gle/developers

#GoogleIO

Products Mentioned: Gemini
Event: Google I/O 2024

Комментарии

Информация по комментариям в разработке