David Silver: AlphaGo, AlphaZero, And Deep Reinforcement Learning | Lex Fridman Podcast #86

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass:
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store):
– Cash App (Google Play):

EPISODE LINKS:
Reinforcement learning (book):

PODCAST INFO:
Podcast website:

Apple Podcasts:

Spotify:

RSS:

Full episodes playlist:

Clips playlist:

OUTLINE:
0:00 – Introduction
4:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

CONNECT:
– Subscribe to this YouTube channel
– Twitter:
– LinkedIn:
– Facebook:
– Instagram:
– Medium:
– Support on Patreon:

source

What's Hot

Alibaba Cloud Releases the Qwen3-Next Base Model Architecture and Open Sources the 80B-A3B Series_model_this_two

Automatic Memory of Chat Content_has_memory_users’

Indian techie who once worked at IBM Bengaluru left software engineering because…

David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

Dave Hone: T-Rex, Dinosaurs, Extinction, Evolution, and Jurassic Park | Lex Fridman Podcast #480

Dave Plummer: Programming, Autism, and Old-School Microsoft Stories | Lex Fridman Podcast #479

Scott Horton: The Case Against War and the Military Industrial Complex | Lex Fridman Podcast #478

Sally Mann Says Her Black Men Photos Are ‘Problematic’ in Hindsight

NeueHouse, a Hot Spot for Art Events, Files for Bankruptcy

Obama Presidential Center Announces Nine New Artist Commissions

Italy Protests Return of Carpaccio Altarpiece to Slovenia

Alibaba Cloud Releases the Qwen3-Next Base Model Architecture and Open Sources the 80B-A3B Series_model_this_two

Automatic Memory of Chat Content_has_memory_users’

Indian techie who once worked at IBM Bengaluru left software engineering because…

What's Hot

David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

Related Posts

Subscribe to Updates