Fractal Releases Fathom-R1-14B Reasoning Model On DeepSeek For $499

Fractal, Mumbai based AI company, has launched a new open-source large language model, Fathom-R1-14B. The model delivers mathematical reasoning performance that surpasses o1-mini and o3-mini, and approaches o4-mini levels, all at a post-training cost of just $499.

The model is available to try on Hugging Face, and the codebase is on GitHub. It is available under the MIT license, along with datasets and training recipes.

Developed as part of a proposed initiative to build India’s first large reasoning model under the IndiaAI mission, Fathom-R1-14B is a 14-billion-parameter model derived from Deepseek-R1-Distilled-Qwen-14B.

“We proposed building India’s first large reasoning model as part of the IndiaAI mission. We proposed building three models (a small one, a mid-sized one and a large one with 70 billion parameters),” said Fractal CEO Srikanth Velamakanni in a LinkedIn post.

He further added that “This is just a tiny proof of what’s possible.”

On olympiad-level exams AIME-25 and HMMT-25, Fathom-R1-14B achieves 52.71% and 35.26% Pass@1 accuracy, respectively. When allowed additional inference-time compute (cons@64), the scores rise to 76.7% and 56.7%.

“It delivers performance rivalling closed-source o4-mini (low) with respect to cons@64 ,all while staying within a 16K context window,” the company said.

The model was post-trained using supervised fine-tuning (SFT), curriculum learning, and model merging.

“We perform supervised fine-tuning on carefully curated datasets using a specific training approach, followed by model merging,” the company said.

Fractal has also introduced a separate variant, Fathom-R1-14B-RS, achieved similar results using a combination of reinforcement learning and SFT, costing $967.

Last year, the company launched Vaidya.ai, a multi-modal AI platform designed to offer free and accessible healthcare assistance. Meanwhile, Sarvam, the startup selected for building India’s foundational LLM under the IndiaAI Mission recently unveiled Sarvam-M, a 24-billion parameter open-weights hybrid language model built on top of Mistral Small.

Source link

What's Hot

Hybrid Reinforcement: When Reward Is Sparse, It’s Better to Be Dense – Takara TLDR

GyroSwin: 5D Surrogates for Gyrokinetic Plasma Turbulence Simulations – Takara TLDR

OpenAI Will Stop Saving Users’ Deleted Posts

Fractal Releases Fathom-R1-14B Reasoning Model on DeepSeek for $499

When You Tell AI Models to Act Like Women, Most Become More Risk-Averse: Study

Ant Group Launches Ling-1T: China’s Trillion-Parameter AI Model to Rival OpenAI and DeepSeek

New York-Based Reflection AI Raises $2B, Hits $8B Valuation

Smithsonian Closes Museums Amid Government Shutdown

The Rubin Names 2025 Art Prize, Research and Art Projects Grants

Kochi-Muziris Biennial Announces 66 Artists for December Exhibition

Instagram Launches ‘Rings’ Awards for Creators—With KAWS as a Judge

Hybrid Reinforcement: When Reward Is Sparse, It’s Better to Be Dense – Takara TLDR

GyroSwin: 5D Surrogates for Gyrokinetic Plasma Turbulence Simulations – Takara TLDR

OpenAI Will Stop Saving Users’ Deleted Posts

What's Hot

Fractal Releases Fathom-R1-14B Reasoning Model on DeepSeek for $499

Related Posts

Subscribe to Updates