Browsing: Google DeepMind
Research Scientist Hado van Hasselt covers prediction algorithms for policy improvement, leading to algorithms that can learn good behaviour policies…
Research Scientist Hado van Hasselt takes a closer look at model-free prediction and its relation to Monte Carlo and temporal…
Research Scientist Diana Borsa explores dynamic programming algorithms as contraction mappings, looking at when and how they converge to the…
Research Scientist Diana Borsa explains how to solve MDPs with dynamic programming to extract accurate predictions and good control policies.…
Research Scientist Hado van Hasselt looks at why it’s important for learning agents to balance exploring and exploiting acquired knowledge…
Research Scientist Hado van Hasselt introduces the reinforcement learning course and explains how reinforcement learning relates to AI. Slides: Full…
Hannah explores the potential of language models, the questions they raise, and if teaching a computer about language is enough…
In December 2019, DeepMind’s AI system, AlphaFold, solved a 50-year-old grand challenge in biology, known as the protein-folding problem. A…
Cooperation is at the heart of our society. Inventing the railway, giving birth to the Renaissance, and creating the Covid-19…
Do you need a body to have intelligence? And can one exist without the other? Hannah takes listeners behind the…