What Is Q-Learning (back To Basics)

#qlearning #qstar #rlhf

What is Q-Learning and how does it work? A brief tour through the background of Q-Learning, Markov Decision Processes, Deep Q-Networks, and other basics necessary to understand Q* 😉

OUTLINE:
0:00 – Introduction
2:00 – Reinforcement Learning
7:00 – Q-Functions
19:00 – The Bellman Equation
26:00 – How to learn the Q-Function?
38:00 – Deep Q-Learning
42:30 – Summary

Paper:
My old video on DQN:

Links:
Homepage:
Merch:
YouTube:
Twitter:
Discord:
LinkedIn:

If you want to support me, the best thing to do is to share out the content 🙂

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar:
Patreon:
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n

source

What's Hot

OpenAI’s upcoming AI features won’t be free, reveals Sam Altman

Who Are the Top 21 Artificial Intelligence (AI) Software Companies in 2025?

VC-Backed Lex Generalis Launches, Rejects Hourly Model – Artificial Lawyer

What is Q-Learning (back to basics)

AGI is not coming!

Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis)

Energy-Based Transformers are Scalable Learners and Thinkers (Paper Review)

Hidden Portrait May Be Vermeer’s Earliest Known Work

Who Are the Art World Figures on the Time 100 List?

Acquavella Signs Harumi Klossowska de Rola, Daughter of Balthus

Heirs of Jewish Collector Urge Court to Reconsider Claim to Sunflowers

OpenAI’s upcoming AI features won’t be free, reveals Sam Altman

Who Are the Top 21 Artificial Intelligence (AI) Software Companies in 2025?

VC-Backed Lex Generalis Launches, Rejects Hourly Model – Artificial Lawyer

What's Hot

What is Q-Learning (back to basics)

Related Posts

Subscribe to Updates