U-Bench: A Comprehensive Understanding Of U-Net Through 100-Variant Benchmarking - Takara TLDR

Over the past decade, U-Net has been the dominant architecture in medical
image segmentation, leading to the development of thousands of U-shaped
variants. Despite its widespread adoption, there is still no comprehensive
benchmark to systematically evaluate their performance and utility, largely
because of insufficient statistical validation and limited consideration of
efficiency and generalization across diverse datasets. To bridge this gap, we
present U-Bench, the first large-scale, statistically rigorous benchmark that
evaluates 100 U-Net variants across 28 datasets and 10 imaging modalities. Our
contributions are threefold: (1) Comprehensive Evaluation: U-Bench evaluates
models along three key dimensions: statistical robustness, zero-shot
generalization, and computational efficiency. We introduce a novel metric,
U-Score, which jointly captures the performance-efficiency trade-off, offering
a deployment-oriented perspective on model progress. (2) Systematic Analysis
and Model Selection Guidance: We summarize key findings from the large-scale
evaluation and systematically analyze the impact of dataset characteristics and
architectural paradigms on model performance. Based on these insights, we
propose a model advisor agent to guide researchers in selecting the most
suitable models for specific datasets and tasks. (3) Public Availability: We
provide all code, models, protocols, and weights, enabling the community to
reproduce our results and extend the benchmark with future methods. In summary,
U-Bench not only exposes gaps in previous evaluations but also establishes a
foundation for fair, reproducible, and practically relevant benchmarking in the
next decade of U-Net-based segmentation models. The project can be accessed at:
https://fenghetan9.github.io/ubench. Code is available at:
https://github.com/FengheTan9/U-Bench.

Source link

What's Hot

Lucio, Lightbringer, Harvey, Jus Mundi, SpotDraft, LI UK + NY – Artificial Lawyer

SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models – Takara TLDR

Alibaba’s Qwen Team Takes Off! Lin Junyang Leads the Charge as a Major Player Joins the Embodied Intelligence Arena_known_team_models

U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking – Takara TLDR

SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models – Takara TLDR

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints – Takara TLDR

SDQM: Synthetic Data Quality Metric for Object Detection Dataset Evaluation – Takara TLDR

Frieze to Launch Abu Dhabi Fair in November 2026

Jeff Koons Returns to Gagosian with First New York Show in Seven Years

$45 M. Basquait Painting to Headline Sotheby’s Fall Sales in New York

Guggenheim’s 2026 Shows Include Carol Bove Survey, Taryn Simon Project

Lucio, Lightbringer, Harvey, Jus Mundi, SpotDraft, LI UK + NY – Artificial Lawyer

SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models – Takara TLDR

Alibaba’s Qwen Team Takes Off! Lin Junyang Leads the Charge as a Major Player Joins the Embodied Intelligence Arena_known_team_models

What's Hot

U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking – Takara TLDR

Related Posts

Subscribe to Updates