Browsing: Hugging Face

Hugging Face

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models – Takara TLDR

Advanced AI EditorOctober 1, 2025

Reinforcement Learning (RL) has shown remarkable success in enhancing the reasoning capabilities of Large Language Models (LLMs). Process-Supervised RL (PSRL)…

Hugging Face

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones – Takara TLDR

Advanced AI EditorOctober 1, 2025

Does RL teach LLMs genuinely new skills, or does it merely activate existing ones? This question lies at the core…

Hugging Face

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech – Takara TLDR

Advanced AI EditorOctober 1, 2025

We present MGM-Omni, a unified Omni LLM for omni-modal understanding and expressive, long-horizon speech generation. Unlike cascaded pipelines that isolate…

Hugging Face

Pretraining Large Language Models with NVFP4 – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Large Language Models (LLMs) today are powerful problem solvers across many domains, and they continue to get stronger as they…

Hugging Face

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Vision language models (VLMs) achieve unified modeling of images and text, enabling them to accomplish complex real-world tasks through perception,…

Hugging Face

Rolling Forcing: Autoregressive Long Video Diffusion in Real Time – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Streaming video generation, as one fundamental component in interactive world models and neural game engines, aims to generate high-quality, low-latency,…

Hugging Face

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Large language model (LLM) steering has emerged as a promising paradigm for controlling model behavior at inference time through targeted…

Hugging Face

SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression – Takara TLDR

Advanced AI EditorSeptember 30, 2025

We introduce SIRI, Scaling Iterative Reinforcement Learning with Interleaved Compression, a simple yet effective RL approach for Large Reasoning Models…

Hugging Face

PixelCraft: A Multi-Agent System for High-Fidelity Visual Reasoning on Structured Images – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Structured images (e.g., charts and geometric diagrams) remain challenging for multimodal large language models (MLLMs), as perceptual slips can cascade…

Hugging Face

Visual Jigsaw Post-Training Improves MLLMs – Takara TLDR

Advanced AI EditorSeptember 30, 2025

Reinforcement learning based post-training has recently emerged as a powerful paradigm for enhancing the alignment and reasoning capabilities of multimodal…

What's Hot

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness – Takara TLDR

Samsung Electronics, SK Hynix Shares Soar On OpenAI’s Korean Data Center Push

Tesla Optimus is learning martial arts in new video teasing capabilities

Browsing: Hugging Face

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models – Takara TLDR

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones – Takara TLDR

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech – Takara TLDR

Pretraining Large Language Models with NVFP4 – Takara TLDR

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts – Takara TLDR

Rolling Forcing: Autoregressive Long Video Diffusion in Real Time – Takara TLDR

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering – Takara TLDR

SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression – Takara TLDR

PixelCraft: A Multi-Agent System for High-Fidelity Visual Reasoning on Structured Images – Takara TLDR

Visual Jigsaw Post-Training Improves MLLMs – Takara TLDR

Record Exec and Art Collector Gets Over 4 Years

Chicago’s Art Scene Offers a Beacon of Hope for Artists and Dealers

Pace to Close Hong Kong Gallery at H Queen’s This Month

Taylor Swift’s ‘Fate of Ophelia’ Has a Lot in Common with This Artwork

Just Do It!? Computer-Use Agents Exhibit Blind Goal-Directedness – Takara TLDR

Samsung Electronics, SK Hynix Shares Soar On OpenAI’s Korean Data Center Push

Tesla Optimus is learning martial arts in new video teasing capabilities

What's Hot

Browsing: Hugging Face

Subscribe to Updates