John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

Описание к видео John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

John Schulman on how posttraining tames the shoggoth, and the nature of the progress to come...

Timestamps:

00:00:00 Pre-training, post-training, and future capabilities
00:17:21 Plan for AGI 2025
00:29:43 Teaching models to reason
00:41:14 The Road to ChatGPT
00:52:37 What makes for a good RL researcher?
01:01:22 Keeping humans in the loop
01:15:39 State of research, plateaus, and moats

Links:

Apple Podcasts: https://podcasts.apple.com/us/podcast...
Spotify: https://open.spotify.com/episode/1ivz...
Transcript: https://www.dwarkeshpatel.com/p/john-...

Me on Twitter:   / dwarkesh_sp  

Sponsors:

If you’re interested in advertising on the podcast, fill out this form: https://airtable.com/appxGOvFLDLP5dlz...

Your DNA shapes everything about you. Want to know how? Take 10% off our Premium DNA kit with code DWARKESH at https://mynucleus.com/

CommandBar is an AI user assistant that any software product can embed to non-annoyingly assist, support, and unleash their users. Used by forward-thinking CX, product, growth, and marketing teams. Learn more at https://www.commandbar.com/

Комментарии

Информация по комментариям в разработке