Browsing: Hugging Face

Hugging Face

Paper page – Vidi: Large Multimodal Models for Video Understanding and Editing

Advanced AI EditorApril 24, 2025

Humans naturally share information with those they are connected to, and video has become one of the dominant mediums for…

Hugging Face

Paper page – WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Advanced AI EditorApril 24, 2025

Can we build accurate world models out of large language models (LLMs)? How can world models benefit LLM agents? The…

Hugging Face

Paper page – CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting

Advanced AI EditorApril 23, 2025

Recognizing and reasoning about occluded (partially or fully hidden) objects is vital to understanding visual scenes, as occlusions frequently occur…

Hugging Face

Paper page – IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property

Advanced AI EditorApril 23, 2025

Intellectual Property (IP) is a unique domain that integrates technical and legal knowledge, making it inherently complex and knowledge-intensive. As…

Hugging Face

Paper page – From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Advanced AI EditorApril 23, 2025

Recent text-to-image diffusion models achieve impressive visual quality through extensive scaling of training data and model parameters, yet they often…

Hugging Face

Paper page – LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Advanced AI EditorApril 23, 2025

Recent video large language models (Video LLMs) often depend on costly human annotations or proprietary model APIs (e.g., GPT-4o) to…

Hugging Face

Paper page – RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild

Advanced AI EditorApril 23, 2025

Controllable character animation remains a challenging problem, particularly in handling rare poses, stylized characters, character-object interactions, complex illumination, and dynamic…

Hugging Face

Paper page – MR. Video: “MapReduce” is the Principle for Long Video Understanding

Advanced AI EditorApril 23, 2025

We propose MR. Video, an agentic long video understanding framework that demonstrates the simple yet effective MapReduce principle for processing…

Hugging Face

Paper page – LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Advanced AI EditorApril 23, 2025

The success of Large Language Models (LLMs) has sparked interest in various agentic applications. A key hypothesis is that LLMs,…

Hugging Face

Paper page – Progent: Programmable Privilege Control for LLM Agents

Advanced AI EditorApril 23, 2025

LLM agents are an emerging form of AI systems where large language models (LLMs) serve as the central component, utilizing…

What's Hot

Buhari, a leader of immense integrity – IBM Haruna

AI Funding Continued Its Hot Streak in February in an Otherwise Dim VC Market

Weaving reality or warping it? The personalization trap in AI systems

Browsing: Hugging Face

Paper page – Vidi: Large Multimodal Models for Video Understanding and Editing

Paper page – WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Paper page – CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting

Paper page – IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property

Paper page – From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Paper page – LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale

Paper page – RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild

Paper page – MR. Video: “MapReduce” is the Principle for Long Video Understanding

Paper page – LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Paper page – Progent: Programmable Privilege Control for LLM Agents

Sam Gilliam Foundation, David Kordansky Sued Over ‘Disavowed’ Painting

Donors Reportedly Pulling Support from Florida University Museum after its Controversial Transfer

What will come of the Guggenheim Asher legal battle?

Painter Says DHS Stole His Work for Post About ‘Homeland’s Heritage’

Buhari, a leader of immense integrity – IBM Haruna

AI Funding Continued Its Hot Streak in February in an Otherwise Dim VC Market

Weaving reality or warping it? The personalization trap in AI systems

What's Hot

Browsing: Hugging Face

Subscribe to Updates