Browsing: Hugging Face

Hugging Face

Paper page – Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study

Advanced AI EditorMay 27, 2025

VLMs are more vulnerable to harmful meme-based prompts than to synthetic images, and while multi-turn interactions offer some protection, significant…

Hugging Face

Paper page – QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Advanced AI EditorMay 27, 2025

A framework called QwenLong-L1 enhances large reasoning models for long-context reasoning through reinforcement learning, achieving leading performance on document question-answering…

Hugging Face

Paper page – Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

Advanced AI EditorMay 26, 2025

The Transformer Copilot framework enhances large language model performance through a Copilot model that refines the Pilot’s logits based on…

Hugging Face

Paper page – On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

Advanced AI EditorMay 26, 2025

Policy gradient algorithms have been successfully applied to enhance the reasoning capabilities of large language models (LLMs). Despite the widespread…

Hugging Face

Paper page – Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks

Advanced AI EditorMay 26, 2025

Orthogonal Residual Updates enhance feature learning and training stability by decomposing module outputs to contribute primarily novel features. Residual connections…

Hugging Face

Paper page – Synthetic Data RL: Task Definition Is All You Need

Advanced AI EditorMay 26, 2025

Synthetic Data RL enhances foundation models through reinforcement learning using only synthetic data, achieving performance comparable to models trained with…

Hugging Face

Paper page – TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios

Advanced AI EditorMay 26, 2025

Temporal reasoning is pivotal for Large Language Models (LLMs) to comprehend the real world. However, existing works neglect the real-world…

Hugging Face

Paper page – Interactive Post-Training for Vision-Language-Action Models

Advanced AI EditorMay 26, 2025

RIPT-VLA is a reinforcement learning-based interactive post-training paradigm that enhances pretrained Vision-Language-Action models using sparse binary success rewards, improving adaptability…

Hugging Face

Paper page – Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

Advanced AI EditorMay 26, 2025

A scalable 3D shape generation framework using sparse volumes and spatial sparse attention, enabling high-resolution generation with reduced computational requirements.…

Hugging Face

Paper page – Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval

Advanced AI EditorMay 26, 2025

Did you know that fine-tuning retrievers & re-rankers on large but unclean training datasets can harm their performance? 😡 In…

What's Hot

Story, Stability AI collaborate to help creators make money from their work in the AI ecosystem

Basecamp Research leverages Microsoft and Nvidia AI to…

Meta Just Escalated the AI Talent War With OpenAI

Browsing: Hugging Face

Paper page – Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study

Paper page – QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper page – Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

Paper page – On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

Paper page – Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks

Paper page – Synthetic Data RL: Task Definition Is All You Need

Paper page – TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios

Paper page – Interactive Post-Training for Vision-Language-Action Models

Paper page – Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

Paper page – Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval

David Geffen Sued By Estranged Husband for Breach of Contract

Auction House Will Sell Egyptian Artifact Despite Concern From Experts

Anish Kapoor Lists New York Apartment for $17.75 M.

Street Fighter 6 Community Rocked by AI Art Controversy

Story, Stability AI collaborate to help creators make money from their work in the AI ecosystem

Basecamp Research leverages Microsoft and Nvidia AI to…

Meta Just Escalated the AI Talent War With OpenAI

What's Hot

Browsing: Hugging Face

Subscribe to Updates