o1 - What is Going On? Why o1 is a 3rd Paradigm of Model + 10 Things You Might Not Know

Описание к видео o1 - What is Going On? Why o1 is a 3rd Paradigm of Model + 10 Things You Might Not Know

o1 is different, and even sceptics are calling it a 'large reasoning model'. But why is it so different, and why does that say about the future? When models are rewarded for correctness of answers, not just harmlessness or predicting the next word. But does even o1 lack spatial reasoning? How did the White House react yesterday? And did Ilya Sutskever warn of o1 getting ... 'creative'?

AI Insiders - Now $9:   / aiexplained  

Chapters:
00:00 – Intro
01:04 – How o1 Works (The 3rd Paradigm)
03:10 – We Don’t Need Human Examples (OpenAI)
03:54 – How o1 Works (Temp 1 Graded)
06:28 – Is This Reasoning?
08:48 – Personal Announcement
11:27 – Hidden, serial Thoughts?
13:11 – Memorized Reasoning?
15:40 – 10 Facts


o1, Learning to Reason - https://openai.com/index/learning-to-...

Species Tweet: https://x.com/nickcammarata/status/18...

Noam Brown Video:    • Parables on the Power of Planning in ...  
https://x.com/polynoamial/status/1834...

2021 Paper on Verifiers: https://arxiv.org/pdf/2110.14168

Let’s Verify Step By Step: https://arxiv.org/pdf/2305.20050

DeepMind Not Far Behind: https://arxiv.org/pdf/2211.14275

Chain of Thought for Serial Problems: https://arxiv.org/pdf/2402.12875

Q* Clues (yes, I am proud of that one):    • Q* - Clues to the Puzzle?  

Let’s Think Pixel by Pixel: https://x.com/geoffreyirving/status/1...

Or Dot by Dot (the general power of CoT): https://arxiv.org/pdf/2404.15758

RL by Karpathy: https://x.com/karpathy/status/1835561...

Not Prompt Engineering: https://x.com/sytelus/status/18354333...

ARC-AGI Analysis: https://arcprize.org/blog/openai-o1-r...

Reality Foundation models: https://x.com/armandjoulin/status/183...

Memorising Reasoning vs CoT Q-Table: https://x.com/rao2z

When You Know the Right CoT: https://x.com/lukaszkaiser/status/183...

Original Information Report: https://www.theinformation.com/articl...

StockFish: https://en.wikipedia.org/wiki/Stockfi...)

Will AGI fall like chess: https://x.com/polynoamial/status/1835...

Fei Fei Start-up $1B: https://www.ft.com/content/0b210299-4...

White House Report: https://www.whitehouse.gov/briefing-r...

Simple-Bench: https://simple-bench.com/try-yourself...

o1 Fails:   / new_o1_still_fails_miserably_at_trivial_qu...  



My New Coursera Course! The 8 Most Controversial Terms in AI: https://imp.i384100.net/m57g3M

Non-hype Newsletter: https://signaltonoise.beehiiv.com/

GenAI Hourly Consulting: https://www.theinsiders.ai/

I use Descript to edit my videos: https://get.descript.com/ldgxfuj2bhnb

Many people expense AI Insiders for work. Feel free to use this template: https://docs.google.com/document/d/1l...

AI Insiders - Now $9:   / aiexplained  

Комментарии

Информация по комментариям в разработке