HERMES: Human-to-Robot Embodied Learning From Multi-Source Motion Data For Mobile Dexterous Manipulation - Takara TLDR

Leveraging human motion data to impart robots with versatile manipulation
skills has emerged as a promising paradigm in robotic manipulation.
Nevertheless, translating multi-source human hand motions into feasible robot
behaviors remains challenging, particularly for robots equipped with
multi-fingered dexterous hands characterized by complex, high-dimensional
action spaces. Moreover, existing approaches often struggle to produce policies
capable of adapting to diverse environmental conditions. In this paper, we
introduce HERMES, a human-to-robot learning framework for mobile bimanual
dexterous manipulation. First, HERMES formulates a unified reinforcement
learning approach capable of seamlessly transforming heterogeneous human hand
motions from multiple sources into physically plausible robotic behaviors.
Subsequently, to mitigate the sim2real gap, we devise an end-to-end, depth
image-based sim2real transfer method for improved generalization to real-world
scenarios. Furthermore, to enable autonomous operation in varied and
unstructured environments, we augment the navigation foundation model with a
closed-loop Perspective-n-Point (PnP) localization mechanism, ensuring precise
alignment of visual goals and effectively bridging autonomous navigation and
dexterous manipulation. Extensive experimental results demonstrate that HERMES
consistently exhibits generalizable behaviors across diverse, in-the-wild
scenarios, successfully performing numerous complex mobile bimanual dexterous
manipulation tasks. Project Page:https://gemcollector.github.io/HERMES/.

Source link

What's Hot

Andhra Pradesh government clears IBM proposal to install quantum computer in Amaravati

Bosch relies on humans and AI in customer service

Cohere Targets Enterprise AI Translation with Command A Translate

HERMES: Human-to-Robot Embodied Learning from Multi-Source Motion Data for Mobile Dexterous Manipulation – Takara TLDR

Morae: Proactively Pausing UI Agents for User Choices – Takara TLDR

Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models – Takara TLDR

rStar2-Agent: Agentic Reasoning Technical Report – Takara TLDR

80 Museum Exhibitions and Biennials to See in Fall 2025

Woodmere Art Museum Sues Trump Administration Over Canceled IMLS Grant

Barbara Gladstone’s Chelsea Townhouse in NYC Sells for $13.1 M.

Trump Meets with Smithsonian Leader Amid Threats of Content Review

Andhra Pradesh government clears IBM proposal to install quantum computer in Amaravati

Bosch relies on humans and AI in customer service

Cohere Targets Enterprise AI Translation with Command A Translate

What's Hot

HERMES: Human-to-Robot Embodied Learning from Multi-Source Motion Data for Mobile Dexterous Manipulation – Takara TLDR

Related Posts

Subscribe to Updates