Browsing: Hugging Face
Sel3DCraft enhances text-to-3D generation through a dual-branch retrieval and generation system, multi-view hybrid scoring with MLLMs, and prompt-driven visual analytics,…
SonicMaster, a unified generative model, improves music audio quality by addressing various artifacts using text-based control and a flow-matching generative…
A benchmark and model for 3D occupancy grounding using natural language and voxel-level annotations improve object perception in autonomous driving.…
OpenMed NER, a suite of open-source transformer models using DAPT and LoRA, achieves state-of-the-art performance on diverse biomedical NER benchmarks…
A principled framework that structurally decomposes LoRA fine-tuning updates into alignment-critical and task-specific components using Fisher Information and geodesic constraints,…
Cyber-Zero synthesizes agent trajectories from CTF writeups to train runtime-free cybersecurity LLMs, achieving state-of-the-art performance on benchmarks. Large Language Models…
Meta-reinforcement learning agents can exhibit exploratory behavior when trained with a greedy objective, provided the environment has recurring structure, the…
AgentTTS, an LLM-agent-based framework, optimizes compute allocation for multi-stage complex tasks, improving performance and robustness compared to traditional methods. Test-time…
SWE-Debate, a competitive multi-agent framework, enhances issue resolution in software engineering by promoting diverse reasoning and achieving better issue localization…
A survey of multimodal referring segmentation techniques, covering advancements in convolutional neural networks, transformers, and large language models for segmenting…