Browsing: Hugging Face

Hugging Face

Paper page – MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Advanced AI EditorApril 29, 2025

The integration of long-context capabilities with visual understanding unlocks unprecedented potential for Vision Language Models (VLMs). However, the quadratic attention…

Hugging Face

Paper page – Clinical knowledge in LLMs does not translate to human interactions

Advanced AI EditorApril 29, 2025

Global healthcare providers are exploring use of large language models (LLMs) to provide medical advice to the public. LLMs now…

Hugging Face

Paper page – Towards Understanding Camera Motions in Any Video

Advanced AI EditorApril 28, 2025

We introduce CameraBench, a large-scale dataset and benchmark designed to assess and improve camera motion understanding. CameraBench consists of ~3,000…

Hugging Face

Paper page – DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency

Advanced AI EditorApril 28, 2025

Given a single labeled example, in-context segmentation aims to segment corresponding objects. This setting, known as one-shot segmentation in few-shot…

Hugging Face

Paper page – Interpretable non-linear dimensionality reduction using gaussian weighted linear transformation

Advanced AI EditorApril 26, 2025

Dimensionality reduction techniques are fundamental for analyzing and visualizing high-dimensional data. With established methods like t-SNE and PCA presenting a…

Hugging Face

Paper page – Step1X-Edit: A Practical Framework for General Image Editing

Advanced AI EditorApril 25, 2025

In recent years, image editing models have witnessed remarkable and rapid development. The recent unveiling of cutting-edge multimodal models such…

Hugging Face

Paper page – Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Advanced AI EditorApril 25, 2025

Despite the rapid growth of machine learning research, corresponding code implementations are often unavailable, making it slow and labor-intensive for…

Hugging Face

Paper page – Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Advanced AI EditorApril 25, 2025

The Contrastive Language-Image Pre-training (CLIP) framework has become a widely used approach for multimodal representation learning, particularly in image-text retrieval…

Hugging Face

Paper page – Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation

Advanced AI EditorApril 25, 2025

Visit our project page at: https://apc-vlm.github.io/ 🙂 Abstract:We present a framework for perspective-aware reasoning in vision-language models (VLMs) through mental…

Hugging Face

Paper page – Distilling semantically aware orders for autoregressive image generation

Advanced AI EditorApril 25, 2025

Autoregressive patch-based image generation has recently shown competitive results in terms of image quality and scalability. It can also be…

What's Hot

Perplexity Plans to Bring Comet AI Browser to Smartphones

Hey you, AI algorithm! Explain yourself!

IBM launches global entrance test for MBA, MCA, MSc admissions | Bengaluru News

Browsing: Hugging Face

Paper page – MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper page – Clinical knowledge in LLMs does not translate to human interactions

Paper page – Towards Understanding Camera Motions in Any Video

Paper page – DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency

Paper page – Interpretable non-linear dimensionality reduction using gaussian weighted linear transformation

Paper page – Step1X-Edit: A Practical Framework for General Image Editing

Paper page – Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper page – Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Paper page – Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation

Paper page – Distilling semantically aware orders for autoregressive image generation

Sam Gilliam Foundation, David Kordansky Sued Over ‘Disavowed’ Painting

Donors Reportedly Pulling Support from Florida University Museum after its Controversial Transfer

What will come of the Guggenheim Asher legal battle?

Painter Says DHS Stole His Work for Post About ‘Homeland’s Heritage’

Perplexity Plans to Bring Comet AI Browser to Smartphones

Hey you, AI algorithm! Explain yourself!

IBM launches global entrance test for MBA, MCA, MSc admissions | Bengaluru News

What's Hot

Browsing: Hugging Face

Subscribe to Updates