Florence-2 : Advancing a Unified Representation for a Variety of Vision Tasks | Paper Explained

Описание к видео Florence-2 : Advancing a Unified Representation for a Variety of Vision Tasks | Paper Explained

Florence-2, a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.

GitHub: https://github.com/AarohiSingla/Flore...


Try out the Florence-2 model here: https://huggingface.co/spaces/gokaygo...

Paper: https://arxiv.org/pdf/2311.06242

Florence-2 is pre-trained on our FLD-5B dataset encompassing a total of 5.4B comprehensive annotations across 126M images.

#computervision #largelanguagemodels #languagemodels #microsoft #ai #artificialintelligence

Комментарии

Информация по комментариям в разработке