Browsing: Hugging Face
Vision-Language-Action (VLA) models have shown impressive capabilities across a wide range of robotics manipulation tasks. However, their growing model size…
Dreamland, a hybrid framework, combines physics-based simulators and generative models to improve controllability and image quality in video generation. Large-scale…
The paper introduces γ-PO, a dynamic target margin preference optimization algorithm that enhances Large Language Models’ alignment by adjusting reward…
GUI-Reflection enhances GUI automation by integrating self-reflection and error correction through scalable data pipelines and an iterative online tuning framework.…
Code & Resources: https://github.com/F2-Song/Weak-to-Strong-Decoding Large Language Models (LLMs) require alignment with human preferences to avoid generating offensive, false, or meaningless…
Large language models (LLMs) frequently refuse to respond to pseudo-malicious instructions: semantically harmless input queries triggering unnecessary LLM refusals due…
Foundational to the Chinese language and culture, Chinese characters encompass extraordinarily extensive and ever-expanding categories, with the latest Chinese GB18030-2022…
Recently, techniques such as explicit structured reasoning have demonstrated strong test-time scaling behavior by enforcing a separation between the model’s…
A new reinforcement learning framework, Group Contrastive Policy Optimization (GCPO), enhances geometric reasoning in large language models with judicious auxiliary…
A novel two-stage pipeline using specialized pretrained models and a large language model enhances audio caption quality by integrating diverse…