Browsing: Hugging Face
Reinforcement Learning (RL) has shown remarkable success in enhancing the reasoning capabilities of Large Language Models (LLMs). Process-Supervised RL (PSRL)…
Does RL teach LLMs genuinely new skills, or does it merely activate existing ones? This question lies at the core…
We present MGM-Omni, a unified Omni LLM for omni-modal understanding and expressive, long-horizon speech generation. Unlike cascaded pipelines that isolate…
Large Language Models (LLMs) today are powerful problem solvers across many domains, and they continue to get stronger as they…
Vision language models (VLMs) achieve unified modeling of images and text, enabling them to accomplish complex real-world tasks through perception,…
Streaming video generation, as one fundamental component in interactive world models and neural game engines, aims to generate high-quality, low-latency,…
Large language model (LLM) steering has emerged as a promising paradigm for controlling model behavior at inference time through targeted…
We introduce SIRI, Scaling Iterative Reinforcement Learning with Interleaved Compression, a simple yet effective RL approach for Large Reasoning Models…
Structured images (e.g., charts and geometric diagrams) remain challenging for multimodal large language models (MLLMs), as perceptual slips can cascade…
Reinforcement learning based post-training has recently emerged as a powerful paradigm for enhancing the alignment and reasoning capabilities of multimodal…