Browsing: Hugging Face
Molecular structure elucidation from spectra is a foundational problem in chemistry, with profound implications for compound identification, synthesis, and drug…
Nova Premier is Amazon’s most capable multimodal foundation model and teacher for model distillation. It processes text, images, and video…
State-of-the-art large multi-modal models (LMMs) face challenges when processing high-resolution images, as these inputs are converted into enormous visual tokens,…
The paper reviews recent studies on memorization in Large Language Models, exploring factors that influence memorization, detection methodologies, and mitigation…
Tora2 enhances motion-guided video generation by introducing a decoupled personalization extractor, gated self-attention mechanism, and contrastive loss, enabling simultaneous multi-entity…
LLM-based web agents have recently made significant progress, but much of it hasoccurred in closed-source systems—widening the gap with open-source…
A large-scale dataset and verification tool are introduced for assessing and improving cross-disciplinary reasoning capabilities in multimodal models. In this…
DreamVLA improves robot manipulation through a VLA framework that incorporates world knowledge, dynamic-region guidance, and a diffusion-based transformer to ensure…
RefineX is a scalable framework for improving the quality of large language model pre-training data through programmatic editing, yielding better…
Tencent Hunyuan Releases ArtifactsBench: A Next-Generation “What-You-See-Is-What-You-Get” Evaluation Standard for Code Generation ArtifactsBench is designed to comprehensively measure large language…