Fractal Releases Fathom-R1-14B Reasoning Model On DeepSeek For $499

Fractal, Mumbai based AI company, has launched a new open-source large language model, Fathom-R1-14B. The model delivers mathematical reasoning performance that surpasses o1-mini and o3-mini, and approaches o4-mini levels, all at a post-training cost of just $499.

The model is available to try on Hugging Face, and the codebase is on GitHub. It is available under the MIT license, along with datasets and training recipes.

Developed as part of a proposed initiative to build India’s first large reasoning model under the IndiaAI mission, Fathom-R1-14B is a 14-billion-parameter model derived from Deepseek-R1-Distilled-Qwen-14B.

“We proposed building India’s first large reasoning model as part of the IndiaAI mission. We proposed building three models (a small one, a mid-sized one and a large one with 70 billion parameters),” said Fractal CEO Srikanth Velamakanni in a LinkedIn post.

He further added that “This is just a tiny proof of what’s possible.”

On olympiad-level exams AIME-25 and HMMT-25, Fathom-R1-14B achieves 52.71% and 35.26% Pass@1 accuracy, respectively. When allowed additional inference-time compute (cons@64), the scores rise to 76.7% and 56.7%.

“It delivers performance rivalling closed-source o4-mini (low) with respect to cons@64 ,all while staying within a 16K context window,” the company said.

The model was post-trained using supervised fine-tuning (SFT), curriculum learning, and model merging.

“We perform supervised fine-tuning on carefully curated datasets using a specific training approach, followed by model merging,” the company said.

Fractal has also introduced a separate variant, Fathom-R1-14B-RS, achieved similar results using a combination of reinforcement learning and SFT, costing $967.

Last year, the company launched Vaidya.ai, a multi-modal AI platform designed to offer free and accessible healthcare assistance. Meanwhile, Sarvam, the startup selected for building India’s foundational LLM under the IndiaAI Mission recently unveiled Sarvam-M, a 24-billion parameter open-weights hybrid language model built on top of Mistral Small.

Source link

What's Hot

Amazon Music’s new AI feature generates personalized playlists every Monday

AI Made Her a Better Mom. so She Vibe-Coded a Web App for Others.

Sam Altman says that bots are making social media feel ‘fake’

Fractal Releases Fathom-R1-14B Reasoning Model on DeepSeek for $499

Alibaba Stock Climbs Over 3% In Monday Pre-Market: What’s Going On? – Alibaba Gr Hldgs (NYSE:BABA), Alphabet (NASDAQ:GOOG)

Alibaba unveils its largest AI model with 1 trillion parameters

Alibaba shares rose after release if its biggest AI model yet

Storied Collector and MoMA Trustee Dies at 92

Congress Obtains Drawing Trump Apparently Made for Jeffrey Epstein

New Banksy Work at London’s Royal Courts Immediately Covered Up

John Pritzker Donates 188 Dada and Surrealist Works to the Met Museum

Amazon Music’s new AI feature generates personalized playlists every Monday

AI Made Her a Better Mom. so She Vibe-Coded a Web App for Others.

Sam Altman says that bots are making social media feel ‘fake’

What's Hot

Fractal Releases Fathom-R1-14B Reasoning Model on DeepSeek for $499

Related Posts

Subscribe to Updates