Browsing: Hugging Face

Hugging Face

Precise Action-to-Video Generation Through Visual Action Prompts – Takara TLDR

Advanced AI EditorAugust 19, 2025

We present visual action prompts, a unified action representation for action-to-video generation of complex high-DoF interactions while maintaining transferable visual…

Hugging Face

Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models – Takara TLDR

Advanced AI EditorAugust 19, 2025

Video relighting is a challenging yet valuable task, aiming to replace the background in videos while correspondingly adjusting the lighting…

Hugging Face

PaperRegister: Boosting Flexible-grained Paper Search via Hierarchical Register Indexing – Takara TLDR

Advanced AI EditorAugust 18, 2025

Paper search is an important activity for researchers, typically involving using a query with description of a topic to find…

Hugging Face

StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation – Takara TLDR

Advanced AI EditorAugust 18, 2025

We introduce StyleMM, a novel framework that can construct a stylized 3D Morphable Model (3DMM) based on user-defined text descriptions…

Hugging Face

Controlling Multimodal LLMs via Reward-guided Decoding – Takara TLDR

Advanced AI EditorAugust 18, 2025

As Multimodal Large Language Models (MLLMs) gain widespread applicability, it is becoming increasingly desirable to adapt them for diverse user…

Hugging Face

SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation – Takara TLDR

Advanced AI EditorAugust 18, 2025

Deep learning has revolutionized medical imaging, but its effectiveness is severely limited by insufficient labeled training data. This paper introduces…

Hugging Face

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning – Takara TLDR

Advanced AI EditorAugust 15, 2025

Multimodal Large Language Models (MLLMs) have demonstrated impressive capabilities across various tasks, but still struggle with complex mathematical reasoning. Existing…

Hugging Face

PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts

Advanced AI EditorAugust 15, 2025

A benchmark called PRELUDE evaluates long-context understanding by assessing the consistency of prequel stories with original books, revealing significant challenges…

Hugging Face

HumanSense: From Multimodal Perception to Empathetic Context-Aware Responses through Reasoning MLLMs – Takara TLDR

Advanced AI EditorAugust 15, 2025

While Multimodal Large Language Models (MLLMs) show immense promise for achieving truly human-like interactions, progress is hindered by the lack…

Hugging Face

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models – Takara TLDR

Advanced AI EditorAugust 15, 2025

Reinforcement learning with verifiable rewards (RLVR), which typically adopts Pass@1 as the reward, has faced the issues in balancing exploration…

What's Hot

How the Launch of AI Teammates and Moveworks Partnership at Asana (ASAN) Has Changed Its Investment Story

OpenAI, Jony Ive struggle with technical details on secretive new AI gadget

Setting Up A CLM – From Pilot to Production in 90 Days – Artificial Lawyer

Browsing: Hugging Face

Precise Action-to-Video Generation Through Visual Action Prompts – Takara TLDR

Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models – Takara TLDR

PaperRegister: Boosting Flexible-grained Paper Search via Hierarchical Register Indexing – Takara TLDR

StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation – Takara TLDR

Controlling Multimodal LLMs via Reward-guided Decoding – Takara TLDR

SPARSE Data, Rich Results: Few-Shot Semi-Supervised Learning via Class-Conditioned Image Translation – Takara TLDR

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning – Takara TLDR

PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts

HumanSense: From Multimodal Perception to Empathetic Context-Aware Responses through Reasoning MLLMs – Takara TLDR

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models – Takara TLDR

Sotheby’s to Sell René Magritte Held in Same Collection for 100 years

Former ARTnews Publisher Dies at 97

National Gallery of Art Closes as a Result of Government Shutdown

Almine Rech Closes London Gallery After More Than a Decade

How the Launch of AI Teammates and Moveworks Partnership at Asana (ASAN) Has Changed Its Investment Story

OpenAI, Jony Ive struggle with technical details on secretive new AI gadget

Setting Up A CLM – From Pilot to Production in 90 Days – Artificial Lawyer

What's Hot

Browsing: Hugging Face

Subscribe to Updates