Do You Need Proprioceptive States In Visuomotor Policies? - Takara TLDR

Imitation-learning-based visuomotor policies have been widely used in robot
manipulation, where both visual observations and proprioceptive states are
typically adopted together for precise control. However, in this study, we find
that this common practice makes the policy overly reliant on the proprioceptive
state input, which causes overfitting to the training trajectories and results
in poor spatial generalization. On the contrary, we propose the State-free
Policy, removing the proprioceptive state input and predicting actions only
conditioned on visual observations. The State-free Policy is built in the
relative end-effector action space, and should ensure the full task-relevant
visual observations, here provided by dual wide-angle wrist cameras. Empirical
results demonstrate that the State-free policy achieves significantly stronger
spatial generalization than the state-based policy: in real-world tasks such as
pick-and-place, challenging shirt-folding, and complex whole-body manipulation,
spanning multiple robot embodiments, the average success rate improves from 0\%
to 85\% in height generalization and from 6\% to 64\% in horizontal
generalization. Furthermore, they also show advantages in data efficiency and
cross-embodiment adaptation, enhancing their practicality for real-world
deployment.

Source link

What's Hot

Legal Innovators UK Flash Giveaway + NY Express Tickets – Artificial Lawyer

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning – Takara TLDR

Nvidia’s Josh Parker Sounds Alarm On AI Power Demand But Hints At This Unexpected Silver Lining – Alphabet (NASDAQ:GOOG), Amazon.com (NASDAQ:AMZN)

Do You Need Proprioceptive States in Visuomotor Policies? – Takara TLDR

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning – Takara TLDR

Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation – Takara TLDR

MAPO: Mixed Advantage Policy Optimization – Takara TLDR

Burmese Curator Flees Thailand After China Censors Art Exhibition

Art Dealer Mary Boone Says Prison Was ‘Very Relaxing’

New Research Supports Theory of Hidden Vermeer Self-Portrait

John Singer Sargent Paintings Expected to Bring In $12-15 Million

Legal Innovators UK Flash Giveaway + NY Express Tickets – Artificial Lawyer

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning – Takara TLDR

Nvidia’s Josh Parker Sounds Alarm On AI Power Demand But Hints At This Unexpected Silver Lining – Alphabet (NASDAQ:GOOG), Amazon.com (NASDAQ:AMZN)

What's Hot

Do You Need Proprioceptive States in Visuomotor Policies? – Takara TLDR

Related Posts

Subscribe to Updates