Browsing: Hugging Face
We introduce BitNet b1.58 2B4T, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale. Trained…
Color plays an important role in human perception and usually provides critical clues in visual reasoning. However, it is unclear…
An ideal detection system for machine generated content is supposed to work well on any generator as many more advanced…
The application of diffusion models in 3D LiDAR scene completion is limited due to diffusion’s slow sampling speed. Score distillation…
Reinforcement learning (RL) has become a prevailing approach for fine-tuning large language models (LLMs) on complex reasoning tasks. Among recent…
This method is designed to significantly speed up the previously proposed Forgetting Transformer (FoX) without any performance degradation. FoX adds…
Deyuan Liu1* · Peng Sun1,2* · Xufeng Li1,3 · Tao Lin1† 1 Westlake University 2 Zhejiang University…
Web agents enable users to perform tasks on web browsers through natural language interaction. Evaluating web agents trajectories is an…
We introduce S1-Bench, a novel benchmark designed to evaluate Large Reasoning Models’ (LRMs) performance on simple tasks that favor intuitive…
Recent advances in reinforcement learning (RL)-based post-training have led tonotable improvements in large language models (LLMs), particularly in enhancingtheir reasoning…