Browsing: Hugging Face
As Large Language Models (LLMs) are increasingly applied to document-based tasks – such as document summarization, question answering, and information…
Recent developments in Large Language Models (LLMs) have shifted from pre-training scaling to post-training and test-time scaling. Across these developments,…
First Foundational and Conceptual Survey of VLAs Vision-Language-Action (VLA) models mark a transformative advancement in artificial intelligence, aiming to unify…
Robust and efficient local feature matching plays a crucial role in applications such as SLAM and visual localization for robotics.…
Contrastive Language-Image Pre-training (CLIP) excels in multimodal tasks such as image-text retrieval and zero-shot classification but struggles with fine-grained understanding…
Large language model (LLM) unlearning is critical in real-world applications where it is necessary to efficiently remove the influence of…
Aligning language models with human preferences relies on pairwise preference datasets. While some studies suggest that on-policy data consistently outperforms…
Chain-of-thoughts (CoT) requires large language models (LLMs) to generate intermediate steps before reaching the final answer, and has been proven…
💫 Excited to share our recent work: BrowseComp-ZH, the first high-difficulty benchmark specifically designed to evaluate large language models (LLMs)…
Most existing video anomaly detectors rely solely on RGB frames, which lack the temporal resolution needed to capture abrupt or…