Browsing: Hugging Face
3D part amodal segmentation–decomposing a 3D shape into complete, semantically meaningful parts, even when occluded–is a challenging but crucial task…
Mixture-of-Experts (MoE) Large Language Models (LLMs) suffer from severely sub-optimal expert pathways-our study reveals that naive expert selection learned from…
Recent progress in diffusion models significantly advances various image generation tasks. However, the current mainstream approach remains focused on building…
In this paper, we present an effective method to enhance visual reasoning with significantly fewer training samples, relying purely on…
We present a novel, open-source social network simulation framework, MOSAIC, where generative language agents predict user behaviors such as liking,…
Despite the existing evolution of Multimodal Large Language Models (MLLMs), a non-neglectable limitation remains in their struggle with visual text…
We find that the response length of reasoning LLMs, whether trained by reinforcement learning or supervised learning, drastically increases for…
We release OLMoTrace, a tool that lets you trace the outputs of language models back to their full, multi-trillion-token training…
Reasoning has emerged as the next major frontier for language models (LMs), with rapid advances from both academic and industrial…
Creating a realistic animatable avatar from a single static portrait remains challenging. Existing approaches often struggle to capture subtle facial…