Paper Page - MB-ORES: A Multi-Branch Object Reasoner For Visual Grounding In Remote Sensing

We propose a unified framework that integrates object detection (OD) and
visual grounding (VG) for remote sensing (RS) imagery. To support conventional
OD and establish an intuitive prior for VG task, we fine-tune an open-set
object detector using referring expression data, framing it as a partially
supervised OD task. In the first stage, we construct a graph representation of
each image, comprising object queries, class embeddings, and proposal
locations. Then, our task-aware architecture processes this graph to perform
the VG task. The model consists of: (i) a multi-branch network that integrates
spatial, visual, and categorical features to generate task-aware proposals, and
(ii) an object reasoning network that assigns probabilities across proposals,
followed by a soft selection mechanism for final referring object localization.
Our model demonstrates superior performance on the OPT-RSVG and DIOR-RSVG
datasets, achieving significant improvements over state-of-the-art methods
while retaining classical OD capabilities. The code will be available in our
repository: https://github.com/rd20karim/MB-ORES.

Source link

What's Hot

Who Are the Top 21 Artificial Intelligence (AI) Software Companies in 2025?

VC-Backed Lex Generalis Launches, Rejects Hourly Model – Artificial Lawyer

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation – Takara TLDR

Paper page – MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation – Takara TLDR

Research Paper – Takara TLDR

2D Gaussian Splatting with Semantic Alignment for Image Inpainting – Takara TLDR

Hidden Portrait May Be Vermeer’s Earliest Known Work

Who Are the Art World Figures on the Time 100 List?

Acquavella Signs Harumi Klossowska de Rola, Daughter of Balthus

Heirs of Jewish Collector Urge Court to Reconsider Claim to Sunflowers

Who Are the Top 21 Artificial Intelligence (AI) Software Companies in 2025?

VC-Backed Lex Generalis Launches, Rejects Hourly Model – Artificial Lawyer

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation – Takara TLDR

What's Hot

Paper page – MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing

Related Posts

Subscribe to Updates