o3 (Part 2) - Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias

Скачать o3 (Part 2) - Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias бесплатно в качестве 4к (2к / 1080p)

У нас вы можете скачать бесплатно o3 (Part 2) - Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias или посмотреть видео с ютуба в максимальном доступном качестве.

Для скачивания выберите вариант из формы ниже:

Cкачать музыку o3 (Part 2) - Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias бесплатно в формате MP3:

Если иконки загрузки не отобразились, ПОЖАЛУЙСТА, НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если у вас возникли трудности с загрузкой, пожалуйста, свяжитесь с нами по контактам, указанным в нижней части страницы.
Спасибо за использование сервиса video2dn.com

Описание к видео o3 (Part 2) - Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias

o3 (Part 2): Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias

o3 is indeed groundbreaking, and shows that we might be close to finding a general training procedure that can self-improve with fine-tuning.

Here's part 2 of my discussion session on how o3 works based on my own understanding of it (or more generally, architectures that bootstrap learning via fine-tuning on correct trajectories)!

While o3 is powerful, I do not think o3-type architecture is the only way ahead for learning.

I believe that fine-tuning on own trajectories is slow to learn, and having a procedure to learn with external memory is very important (and missing) right now!

I also believe that learning from arbitrary start and end states in a trajectory is important - for instance, in math, we do not want to just learn the goal / model answer, but perhaps also how to reach every intermediate step of the solution.

Moreover, we should consider imbuing some biases so that we can reduce samples needed for training - like filters in Convolutional Neural Networks to bias for neighbouring pixels, so we do not need too many translations of the original image to learn translational invariance/equivariance.

~~~

Slides: https://github.com/tanchongmin/agentj...

References:

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters - https://arxiv.org/abs/2408.03314
Learning, Fast and Slow: https://arxiv.org/abs/2301.13758
LLMs as a System of Multiple Expert Agents: https://arxiv.org/pdf/2310.05146

~~~

0:00 Introduction and Recap
2:21 Impressive Benchmark Scores
8:50 Is it AGI?
9:50 Correct Trajectories
35:45 Tree Search vs Parallel Search
57:14 Path Ahead - Adaptive Benchmarks
1:07:33 Learning, Fast and Slow
1:24:33 Multiple Abstraction Spaces / Neurosymbolic Integration
1:31:25 Discussion
1:54:30 Conclusion

~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord:   / discord
LinkedIn:   / chong-min-tan-94652288
Online AI blog: https://delvingintotech.wordpress.com/
Twitter:   / johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Комментарии

Информация по комментариям в разработке

o3 (Part 2) - Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias

Скачать o3 (Part 2) - Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias бесплатно в качестве 4к (2к / 1080p)

Cкачать музыку o3 (Part 2) - Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias бесплатно в формате MP3:

Описание к видео o3 (Part 2) - Tradeoffs of Heuristics, Tree Search, External Memory, In-built Bias

Похожие видео