EmbeddingGemma: Powerful And Lightweight Text Representations - Takara TLDR

We introduce EmbeddingGemma, a new lightweight, open text embedding model
based on the Gemma 3 language model family. Our innovative training recipe
strategically captures knowledge from larger models via encoder-decoder
initialization and geometric embedding distillation. We improve model
robustness and expressiveness with a spread-out regularizer, and ensure
generalizability by merging checkpoints from varied, optimized mixtures.
Evaluated on the Massive Text Embedding Benchmark (MTEB) across multilingual,
English, and code domains, EmbeddingGemma (300M) achieves state-of-the-art
results. Notably, it outperforms prior top models, both proprietary and open,
with fewer than 500M parameters, and provides performance comparable to models
double its size, offering an exceptional performance-to-cost ratio. Remarkably,
this lead persists when quantizing model weights or truncating embedding
outputs. This makes EmbeddingGemma particularly well-suited for low-latency and
high-throughput use cases such as on-device applications. We provide ablation
studies exploring our key design choices. We release EmbeddingGemma to the
community to promote further research.

Source link

What's Hot

Joe Green + Gunderson’s AI Journey – Artificial Lawyer

Video models are zero-shot learners and reasoners – Takara TLDR

Unity Small Finance Bank Taps IBM to Fast-Track Customer Experience Innovation with Streamlined Application Management

EmbeddingGemma: Powerful and Lightweight Text Representations – Takara TLDR

Video models are zero-shot learners and reasoners – Takara TLDR

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation – Takara TLDR

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning – Takara TLDR

Burmese Curator Flees Thailand After China Censors Art Exhibition

New Research Reveals Source for Dog in Rembrandt’s ‘Night Watch’

Treasures Recovered from Titanic Sister Ship Britannic Off Greek Coast

Superheroes Take Over the Met Opera House in “Super Duper”

Joe Green + Gunderson’s AI Journey – Artificial Lawyer

Video models are zero-shot learners and reasoners – Takara TLDR

Unity Small Finance Bank Taps IBM to Fast-Track Customer Experience Innovation with Streamlined Application Management

What's Hot

EmbeddingGemma: Powerful and Lightweight Text Representations – Takara TLDR

Related Posts

Subscribe to Updates