LeDeepChef 👨‍🍳 Deep Reinforcement Learning Agent For Families Of Text-Based Games

The AI cook is here! This agent learns to play a text-based game where the goal is to prepare a meal according to a recipe. Challenges? Many! The number of possible actions is huge, ingredients change and can include ones never seen before, you need to navigate rooms, use tools, manage an inventory and sequence everything correctly and all of this from a noisy textual description that the game engine throws at you. This paper mixes supervised explicit training with reinforcement learning in order to solve this task.

Abstract:
While Reinforcement Learning (RL) approaches lead to significant achievements in a variety of areas in recent history, natural language tasks remained mostly unaffected, due to the compositional and combinatorial nature that makes them notoriously hard to optimize. With the emerging field of Text-Based Games (TBGs), researchers try to bridge this gap. Inspired by the success of RL algorithms on Atari games, the idea is to develop new methods in a restricted game world and then gradually move to more complex environments. Previous work in the area of TBGs has mainly focused on solving individual games. We, however, consider the task of designing an agent that not just succeeds in a single game, but performs well across a whole family of games, sharing the same theme. In this work, we present our deep RL agent–LeDeepChef–that shows generalization capabilities to never-before-seen games of the same family with different environments and task descriptions. The agent participated in Microsoft Research’s “First TextWorld Problems: A Language and Reinforcement Learning Challenge” and outperformed all but one competitor on the final test set. The games from the challenge all share the same theme, namely cooking in a modern house environment, but differ significantly in the arrangement of the rooms, the presented objects, and the specific goal (recipe to cook). To build an agent that achieves high scores across a whole family of games, we use an actor-critic framework and prune the action-space by using ideas from hierarchical reinforcement learning and a specialized module trained on a recipe database.

Authors: Leonard Adolphs, Thomas Hofmann

source

What's Hot

Tucker Carlson Asks OpenAI CEO Sam Altman If He Ordered Employee’s Murder

The AI drug breakthrough is taking a long time to arrive for reasons that may have little to do with the technology’s limits

Top Free & Paid AI Alternatives to Try in 2025 News24 –

LeDeepChef 👨‍🍳 Deep Reinforcement Learning Agent for Families of Text-Based Games

AGI is not coming!

Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis)

Energy-Based Transformers are Scalable Learners and Thinkers (Paper Review)

Ohio Auction of Two Paintings Looted By Nazis Halted By Foundation

Lee Ufan Painting at Center of Bribery Investigation in Korea

Drought Reveals 40 Ancient Tombs in Northern Iraqi Reservoir

Artifacts Removed from Gaza Building Before Suspected Israeli Strike

Tucker Carlson Asks OpenAI CEO Sam Altman If He Ordered Employee’s Murder

The AI drug breakthrough is taking a long time to arrive for reasons that may have little to do with the technology’s limits

Top Free & Paid AI Alternatives to Try in 2025 News24 –

What's Hot

LeDeepChef 👨‍🍳 Deep Reinforcement Learning Agent for Families of Text-Based Games

Related Posts

Subscribe to Updates