Browsing: Hugging Face

Hugging Face

Paper page – CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Advanced AI EditorApril 20, 2025

Pre-training datasets are typically collected from web content and lack inherent domain divisions. For instance, widely used datasets like Common…

Hugging Face

Paper page – Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Advanced AI EditorApril 20, 2025

Vision-Language Models (VLMs) excel at visual understanding but often suffer from visual hallucinations, where they generate descriptions of nonexistent objects,…

Hugging Face

Paper page – Perception Encoder: The best visual embeddings are not at the output of the network

Advanced AI EditorApril 20, 2025

We introduce Perception Encoder (PE), a state-of-the-art encoder for image and video understanding trained via simple vision-language learning. Traditionally, vision…

Hugging Face

Paper page – Sleep-time Compute: Beyond Inference Scaling at Test-time

Advanced AI EditorApril 20, 2025

Scaling test-time compute has emerged as a key ingredient for enabling large language models (LLMs) to solve difficult problems, but…

Hugging Face

Paper page – InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

Advanced AI EditorApril 19, 2025

Current learning-based subject customization approaches, predominantly relying on U-Net architectures, suffer from limited generalization ability and compromised image quality. Meanwhile,…

Hugging Face

Paper page – CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy

Advanced AI EditorApril 19, 2025

Computational color constancy, or white balancing, is a key module in a camera’s image signal processor (ISP) that corrects color…

Hugging Face

Paper page – PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

Advanced AI EditorApril 19, 2025

Vision-language models are integral to computer vision research, yet many high-performing models remain closed-source, obscuring their data, design and training…

Hugging Face

Paper page – Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking

Advanced AI EditorApril 19, 2025

Single-stream architectures using Vision Transformer (ViT) backbones show great potential for real-time UAV tracking recently. However, frequent occlusions from obstacles…

Hugging Face

Paper page – MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation

Advanced AI EditorApril 19, 2025

Recent smaller language models such Phi-3.5 and Phi-4 rely on synthetic data generated using larger Language models. Questions remain about…

Hugging Face

Paper page – Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark

Advanced AI EditorApril 19, 2025

We introduce 𝙲𝚘𝚖𝚙𝚕𝚎𝚡-𝙴𝚍𝚒𝚝, a comprehensive benchmark designed to systematically evaluate instruction-based image editing models across instructions of varying complexity. To…

What's Hot

Tesla ‘Model Q’ gets bold prediction from Deutsche Bank that investors will love

ServiceNow’s Moveworks Takeover Gets In-Depth Antitrust Review

C3.ai vs. SoundHound: Which AI Stock Has More Upside Right Now?

Browsing: Hugging Face

Paper page – CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper page – Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper page – Perception Encoder: The best visual embeddings are not at the output of the network

Paper page – Sleep-time Compute: Beyond Inference Scaling at Test-time

Paper page – InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

Paper page – CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy

Paper page – PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

Paper page – Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking

Paper page – MetaSynth: Meta-Prompting-Driven Agentic Scaffolds for Diverse Synthetic Data Generation

Paper page – Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark

Sam Gilliam Foundation, David Kordansky Sued Over ‘Disavowed’ Painting

Donors Reportedly Pulling Support from Florida University Museum after its Controversial Transfer

What will come of the Guggenheim Asher legal battle?

Painter Says DHS Stole His Work for Post About ‘Homeland’s Heritage’

Tesla ‘Model Q’ gets bold prediction from Deutsche Bank that investors will love

ServiceNow’s Moveworks Takeover Gets In-Depth Antitrust Review

C3.ai vs. SoundHound: Which AI Stock Has More Upside Right Now?

What's Hot

Browsing: Hugging Face

Subscribe to Updates