8-bit Methods for Efficient Deep Learning -- Tim Dettmers (University of Washington)

clsplanguagespeechprocessing

Скачать 8-bit Methods for Efficient Deep Learning -- Tim Dettmers (University of Washington) бесплатно в качестве 4к (2к / 1080p)

У нас вы можете скачать бесплатно 8-bit Methods for Efficient Deep Learning -- Tim Dettmers (University of Washington) или посмотреть видео с ютуба в максимальном доступном качестве.

Для скачивания выберите вариант из формы ниже:

Cкачать музыку 8-bit Methods for Efficient Deep Learning -- Tim Dettmers (University of Washington) бесплатно в формате MP3:

Если иконки загрузки не отобразились, ПОЖАЛУЙСТА, НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если у вас возникли трудности с загрузкой, пожалуйста, свяжитесь с нами по контактам, указанным в нижней части страницы.
Спасибо за использование сервиса video2dn.com

Описание к видео 8-bit Methods for Efficient Deep Learning -- Tim Dettmers (University of Washington)

Title: 8-bit Methods for Efficient Deep Learning

Abstract: Large language models are effective tools for many tasks but are difficult to train and inference due to their size. Moving from 32-bit models to 16-bit models resulted in considerable efficiency gains that made training and inference of large models easier. Can we train and inference in 8-bit to make further gains? In this talk, I will show that 8-bit inference and training can be used without degrading performance while improving efficiency. To make 8-bit methods work, it is essential to understand how quantization precision affects model performance and training stability as we scale the model size. I will talk about how these factors change with scale and how we need to adjust 8-bit methods to make them work. In particular, I will speak about 8-bit optimizers for training and Int8 inference for large language models with up to 175B parameters. These methods make training and inference more efficient and make large models more accessible to researchers.

Комментарии

Информация по комментариям в разработке