Browsing: Hugging Face
Effective reasoning is crucial to solving complex mathematical problems. Recent large language models (LLMs) have boosted performance by scaling test-time…
Graphical User Interface (GUI) agents offer cross-platform solutions for automating complex digital tasks, with significant potential to transform productivity workflows.…
Scientific equation discovery is a fundamental task in the history of scientific progress, enabling the derivation of laws governing natural…
Overview of EmoEval for Evaluating Mental Safety of AI-human Interactions. The simulation consists of four steps: (1) User Agent Initialization…
World modeling is a crucial task for enabling intelligent agents to effectively interact with humans and operate in dynamic environments.…
Natural Language to SQL (NL2SQL) enables intuitive interactions with databases by transforming natural language queries into structured SQL statements. Despite…
We propose a new problem, In-2-4D, for generative 4D (i.e., 3D + motion) inbetweening from a minimalistic input setting: two…
Building general-purpose models that can effectively perceive the world through multimodal signals has been a long-standing goal. Current approaches involve…
Existing approaches for controlling text-to-image diffusion models, while powerful, do not allow for explicit 3D object-centric control, such as precise…
Current monocular 3D detectors are held back by the limited diversity and scale of real-world datasets. While data augmentation certainly…