Thinking While Moving: Deep Reinforcement Learning With Concurrent Control

Classic RL “stops” the world whenever the Agent computes a new action. This paper considers a more realistic scenario where the agent is thinking about the next action to take while still performing the last action. This results in a fascinating way of reformulating Q-learning in continuous time, then introducing concurrency and finally going back to discrete time.

Abstract:
We study reinforcement learning in settings where sampling an action from the policy must be done concurrently with the time evolution of the controlled system, such as when a robot must decide on the next action while still performing the previous action. Much like a person or an animal, the robot must think and move at the same time, deciding on its next action before the previous one has completed. In order to develop an algorithmic framework for such concurrent control problems, we start with a continuous-time formulation of the Bellman equations, and then discretize them in a way that is aware of system delays. We instantiate this new class of approximate dynamic programming methods via a simple architectural extension to existing value-based deep reinforcement learning algorithms. We evaluate our methods on simulated benchmark tasks and a large-scale robotic grasping task where the robot must “think while moving”.

Authors: Ted Xiao, Eric Jang, Dmitry Kalashnikov, Sergey Levine, Julian Ibarz, Karol Hausman, Alexander Herzog

Links:
YouTube:
Twitter:
BitChute:
Minds:

source

What's Hot

Look To A 10-Year Horizon For The Real Impact Of AI

AI video models could be next big copyright battle | MLex

HR Uses AI To Redefine Strategies

Thinking While Moving: Deep Reinforcement Learning with Concurrent Control

[Paper Analysis] On the Theoretical Limitations of Embedding-Based Retrieval (Warning: Rant)

AGI is not coming!

Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis)

Artist Behind Canterbury Cathedral Art Responds to JD Vance, Elon Musk

Jenkins Johnson Gallery to Open Tribeca Outpost on Marian Goodman Gallery’s Third Floor

Ruth Asawa May Have Broken Record at MoMA—and More Art News

Toledo Museum of Art Director on Digital Art, AI, and Future-Proofing

Look To A 10-Year Horizon For The Real Impact Of AI

AI video models could be next big copyright battle | MLex

HR Uses AI To Redefine Strategies

What's Hot

Thinking While Moving: Deep Reinforcement Learning with Concurrent Control

Related Posts

Subscribe to Updates