R2RGEN: Real-to-Real 3D Data Generation For Spatially Generalized Manipulation - Takara TLDR

Towards the aim of generalized robotic manipulation, spatial generalization
is the most fundamental capability that requires the policy to work robustly
under different spatial distribution of objects, environment and agent itself.
To achieve this, substantial human demonstrations need to be collected to cover
different spatial configurations for training a generalized visuomotor policy
via imitation learning. Prior works explore a promising direction that
leverages data generation to acquire abundant spatially diverse data from
minimal source demonstrations. However, most approaches face significant
sim-to-real gap and are often limited to constrained settings, such as
fixed-base scenarios and predefined camera viewpoints. In this paper, we
propose a real-to-real 3D data generation framework (R2RGen) that directly
augments the pointcloud observation-action pairs to generate real-world data.
R2RGen is simulator- and rendering-free, thus being efficient and
plug-and-play. Specifically, given a single source demonstration, we introduce
an annotation mechanism for fine-grained parsing of scene and trajectory. A
group-wise augmentation strategy is proposed to handle complex multi-object
compositions and diverse task constraints. We further present camera-aware
processing to align the distribution of generated data with real-world 3D
sensor. Empirically, R2RGen substantially enhances data efficiency on extensive
experiments and demonstrates strong potential for scaling and application on
mobile manipulation.

Source link

What's Hot

Billionaire Siebel’s C3.ai boosts IPO price range as investors flock to tech stocks

DeepPrune: Parallel Scaling without Inter-trace Redundancy – Takara TLDR

MIT first to refuse Trump’s sweeping higher education demands

R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation – Takara TLDR

DeepPrune: Parallel Scaling without Inter-trace Redundancy – Takara TLDR

InstructX: Towards Unified Visual Editing with MLLM Guidance – Takara TLDR

CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards – Takara TLDR

Frieze to Launch Abu Dhabi Fair in November 2026

Jeff Koons Returns to Gagosian with First New York Show in Seven Years

Ancient Egyptian Iconography Found in Roman-Era Bathhouse in Turkey

London Gallery Harlesden High Street Goes to Mayfair For a Pop-up

Billionaire Siebel’s C3.ai boosts IPO price range as investors flock to tech stocks

DeepPrune: Parallel Scaling without Inter-trace Redundancy – Takara TLDR

MIT first to refuse Trump’s sweeping higher education demands

What's Hot

R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation – Takara TLDR

Related Posts

Subscribe to Updates