Paper page - DINGO: Constrained Inference for Diffusion LLMs

DINGO, a dynamic programming-based decoding strategy, enhances diffusion language models by enforcing structured output constraints, significantly improving performance on symbolic math and JSON generation tasks.

Diffusion LLMs have emerged as a promising alternative to conventional
autoregressive LLMs, offering significant potential for improved runtime
efficiency. However, existing diffusion models lack the ability to provably
enforce user-specified formal constraints, such as regular expressions, which
makes them unreliable for tasks that require structured outputs, such as
fixed-schema JSON generation. Unlike autoregressive models that generate tokens
sequentially, diffusion LLMs predict a block of tokens in parallel. This
parallelism makes traditional constrained decoding algorithms, which are
designed for sequential token prediction, ineffective at preserving the true
output distribution. To address this limitation, we propose DINGO, a dynamic
programming-based constrained decoding strategy that is both efficient and
provably distribution-preserving. DINGO enables sampling of output strings with
the highest probability under the model’s predicted distribution, while
strictly satisfying any user-specified regular expression. On standard symbolic
math and JSON generation benchmarks, DINGO achieves up to a 68 percentage point
improvement over unconstrained inference

Source link

What's Hot

Nvidia and Snowflake Power Reka AI to Billion-Dollar Heights

OpenAI talks Oracle into another 2M GPUs worth of datacenter • The Register

Anthropic researchers discover the weird AI problem: Why thinking longer makes models dumber

Paper page – DINGO: Constrained Inference for Diffusion LLMs

Paper page – RedOne: Revealing Domain-specific LLM Post-Training in Social Networking Services

Paper page – Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models

Paper page – The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

3,800-Year-Old Warrior’s Tomb Unearthed in Azerbaijan

Removed Romanesque Murals Must Be Returned to Sijena Monastery

President Trump Withdraws US from UNESCO

Morning Links for July 22, 2025

Nvidia and Snowflake Power Reka AI to Billion-Dollar Heights

OpenAI talks Oracle into another 2M GPUs worth of datacenter • The Register

Anthropic researchers discover the weird AI problem: Why thinking longer makes models dumber

What's Hot

Paper page – DINGO: Constrained Inference for Diffusion LLMs

Related Posts

Subscribe to Updates