Paper Page - Music Arena: Live Evaluation For Text-to-Music

Music Arena provides a scalable, interactive platform for evaluating text-to-music models through user-generated preferences and detailed feedback.

We present Music Arena, an open platform for scalable human preference
evaluation of text-to-music (TTM) models. Soliciting human preferences via
listening studies is the gold standard for evaluation in TTM, but these studies
are expensive to conduct and difficult to compare, as study protocols may
differ across systems. Moreover, human preferences might help researchers align
their TTM systems or improve automatic evaluation metrics, but an open and
renewable source of preferences does not currently exist. We aim to fill these
gaps by offering *live* evaluation for TTM. In Music Arena, real-world users
input text prompts of their choosing and compare outputs from two TTM systems,
and their preferences are used to compile a leaderboard. While Music Arena
follows recent evaluation trends in other AI domains, we also design it with
key features tailored to music: an LLM-based routing system to navigate the
heterogeneous type signatures of TTM systems, and the collection of *detailed*
preferences including listening data and natural language feedback. We also
propose a rolling data release policy with user privacy guarantees, providing a
renewable source of preference data and increasing platform transparency.
Through its standardized evaluation protocol, transparent data access policies,
and music-specific features, Music Arena not only addresses key challenges in
the TTM ecosystem but also demonstrates how live evaluation can be thoughtfully
adapted to unique characteristics of specific AI domains.
Music Arena is available at: https://music-arena.org

Source link

What's Hot

India leads the way on Google’s Nano Banana with a local creative twist

SAIC Roewe M7 DMH Launches with Doubao Large Model, Starting at 85,800 Yuan for a New Interactive Smart Travel Experience_world’s_model_the

OpenAI, Google reasoning models achieve gold-level scores in ICPC coding contest

Paper page – Music Arena: Live Evaluation for Text-to-Music

Research Paper – Takara TLDR

2D Gaussian Splatting with Semantic Alignment for Image Inpainting – Takara TLDR

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward – Takara TLDR

Jennifer Packer and Marie Watt Win $250,000 Heinz Award

KAWS Named Uniqlo’s First Artist-in-Residence

Jeffrey Gibson Talks About Animals at Unveiling of New Sculptures at the Met

‘New Yorker’ Commissions High-Profile Artists for Anniversary Covers

India leads the way on Google’s Nano Banana with a local creative twist

SAIC Roewe M7 DMH Launches with Doubao Large Model, Starting at 85,800 Yuan for a New Interactive Smart Travel Experience_world’s_model_the

OpenAI, Google reasoning models achieve gold-level scores in ICPC coding contest

What's Hot

Paper page – Music Arena: Live Evaluation for Text-to-Music

Related Posts

Subscribe to Updates