Browsing: Hugging Face

Hugging Face

Visual Jigsaw Post-Training Improves MLLMs – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Reinforcement learning based post-training has recently emerged as a powerful paradigm for enhancing the alignment and reasoning capabilities of multimodal…

Hugging Face

VGGT-X: When VGGT Meets Dense Novel View Synthesis – Takara TLDR

Advanced AI EditorSeptember 30, 2025

We study the problem of applying 3D Foundation Models (3DFMs) to dense Novel View Synthesis (NVS). Despite significant progress in…

Hugging Face

Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Multimodal large language models (MLLMs) have demonstrated remarkable capabilities in aligning visual inputs with natural language outputs. Yet, the extent…

Hugging Face

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Training LLM agents in multi-turn environments with sparse rewards, where completing a single task requires 30+ turns of interaction within…

Hugging Face

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Reinforcement learning (RL) is the dominant paradigm for sharpening strategic tool use capabilities of LLMs on long-horizon, sparsely-rewarded agent tasks,…

Hugging Face

Quantile Advantage Estimation for Entropy-Safe Reasoning – Takara TLDR

Advanced AI EditorSeptember 29, 2025

Reinforcement Learning with Verifiable Rewards (RLVR) strengthens LLM reasoning, but training often oscillates between {entropy collapse} and {entropy explosion}. We…

Hugging Face

LongLive: Real-time Interactive Long Video Generation – Takara TLDR

Advanced AI EditorSeptember 29, 2025

We present LongLive, a frame-level autoregressive (AR) framework for real-time and interactive long video generation. Long video generation presents challenges…

Hugging Face

SPARK: Synergistic Policy And Reward Co-Evolving Framework – Takara TLDR

Advanced AI EditorSeptember 29, 2025

Recent Large Language Models (LLMs) and Large Vision-Language Models (LVLMs) increasingly use Reinforcement Learning (RL) for post-pretraining, such as RL…

Hugging Face

StateX: Enhancing RNN Recall via Post-training State Expansion – Takara TLDR

Advanced AI EditorSeptember 29, 2025

While Transformer-based models have demonstrated remarkable language modeling performance, their high complexities result in high costs when processing long contexts.…

Hugging Face

Variational Reasoning for Language Models – Takara TLDR

Advanced AI EditorSeptember 29, 2025

We introduce a variational reasoning framework for language models that treats thinking traces as latent variables and optimizes them through…

What's Hot

Is Perplexity’s Comet browser the next big challenger to Chrome?

VLA-R1: Enhancing Reasoning in Vision-Language-Action Models – Takara TLDR

Mapping shifts in the geography of tech innovation: China becomes a big player in AI research

Browsing: Hugging Face

Visual Jigsaw Post-Training Improves MLLMs – Takara TLDR

VGGT-X: When VGGT Meets Dense Novel View Synthesis – Takara TLDR

Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation – Takara TLDR

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning – Takara TLDR

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning – Takara TLDR

Quantile Advantage Estimation for Entropy-Safe Reasoning – Takara TLDR

LongLive: Real-time Interactive Long Video Generation – Takara TLDR

SPARK: Synergistic Policy And Reward Co-Evolving Framework – Takara TLDR

StateX: Enhancing RNN Recall via Post-training State Expansion – Takara TLDR

Variational Reasoning for Language Models – Takara TLDR

Former ARTnews Publisher Dies at 97

Record Exec and Art Collector Gets Over 4 Years

Chicago’s Art Scene Offers a Beacon of Hope for Artists and Dealers

Pace to Close Hong Kong Gallery at H Queen’s This Month

Is Perplexity’s Comet browser the next big challenger to Chrome?

VLA-R1: Enhancing Reasoning in Vision-Language-Action Models – Takara TLDR

Mapping shifts in the geography of tech innovation: China becomes a big player in AI research

What's Hot

Browsing: Hugging Face

Subscribe to Updates