Browsing: Hugging Face
3D content generation has recently attracted significant research interest due to its applications in VR/AR and embodied AI. In this…
Parametric body models offer expressive 3D representation of humans across a wide range of poses, shapes, and facial expressions, typically…
We present Waver, a high-performance foundation model for unified image and video generation. Waver can directly generate videos with durations…
In recent years, with the rapid development of the depth and breadth of large language models’ capabilities, various corresponding evaluation…
Process Reward Models (PRMs) have emerged as a promising framework for supervising intermediate reasoning in large language models (LLMs), yet…
Understanding videos requires more than answering open ended questions, it demands the ability to pinpoint when events occur and how…
Large Language Models (LLMs) have shown promise for financial applications, yet their suitability for this high-stakes domain remains largely unproven…
We introduce Tinker, a versatile framework for high-fidelity 3D editing that operates in both one-shot and few-shot regimes without any…
MeshCoder reconstructs complex 3D objects from point clouds into editable Blender Python scripts, enhancing shape-to-code reconstruction and 3D shape understanding…
Recent advances in diffusion large language models (dLLMs) have introduced a promising alternative to autoregressive (AR) LLMs for natural language…