RGB-Only Supervised Camera Parameter Optimization In Dynamic Scenes - Takara TLDR

Although COLMAP has long remained the predominant method for camera parameter
optimization in static scenes, it is constrained by its lengthy runtime and
reliance on ground truth (GT) motion masks for application to dynamic scenes.
Many efforts attempted to improve it by incorporating more priors as
supervision such as GT focal length, motion masks, 3D point clouds, camera
poses, and metric depth, which, however, are typically unavailable in casually
captured RGB videos. In this paper, we propose a novel method for more accurate
and efficient camera parameter optimization in dynamic scenes solely supervised
by a single RGB video, dubbed ROS-Cam. Our method consists of three key
components: (1) Patch-wise Tracking Filters, to establish robust and maximally
sparse hinge-like relations across the RGB video. (2) Outlier-aware Joint
Optimization, for efficient camera parameter optimization by adaptive
down-weighting of moving outliers, without reliance on motion priors. (3) A
Two-stage Optimization Strategy, to enhance stability and optimization speed by
a trade-off between the Softplus limits and convex minima in losses. We
visually and numerically evaluate our camera estimates. To further validate
accuracy, we feed the camera estimates into a 4D reconstruction method and
assess the resulting 3D scenes, and rendered 2D RGB and depth maps. We perform
experiments on 4 real-world datasets (NeRF-DS, DAVIS, iPhone, and TUM-dynamics)
and 1 synthetic dataset (MPI-Sintel), demonstrating that our method estimates
camera parameters more efficiently and accurately with a single RGB video as
the only supervision.

Source link

What's Hot

Google DeepMind expands frontier AI safety framework to counter manipulation and shutdown risks

OPM adds OpenAI to its employees’ computers

Can AI Help Invent the Next Superconductor? MIT and Samsung Researchers Think So

RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes – Takara TLDR

BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent – Takara TLDR

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification – Takara TLDR

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning – Takara TLDR

St. Patrick’s Cathedral Unveils Monumental Mural by Adam Cvijanovic

Three Loaned Banksy Works Incite Dispute Between England and Italy

Major Collection of Old Masters Paintings Could Be Fractionalized

New Collectors Drive Strong Sales at New York Fair

Google DeepMind expands frontier AI safety framework to counter manipulation and shutdown risks

OPM adds OpenAI to its employees’ computers

Can AI Help Invent the Next Superconductor? MIT and Samsung Researchers Think So

What's Hot

RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes – Takara TLDR

Related Posts

Subscribe to Updates