DeepSeek releases reasoning model to rival OpenAI's o1

Unlike its US competitor, users can access DeepSeek-R1 for free.

By Katie Hignett

January 21, 2025

DeepSeek-R1 seems to excel at math, word problems and programming
It's available via a free chat interface and a low-cost API
A "distilled" version of the open source model can run on a laptop

Chinese AI company DeepSeek has launched a "reasoning" model that beats OpenAI's flagship o1 on certain benchmarks.

Available through a low-cost API and a free chat interface, DeepSeek-R1 appears to outperform its US rival on certain math, word problem and programming tests.

Smaller, "distilled" versions of the model are also available, including one that can run on a laptop. They're all open source and distributed with an MIT license, enabling commercial use by indie hackers.

Performance

Like other reasoning models, DeepSeek-R1 spends longer "thinking" through its responses than traditional large language models, providing better answers to complex questions. It can check its own responses as it thinks, reducing the chance of hallucinations.

Containing some 671 billion parameters, DeepSeek-R1 builds on a preview version released in November 2024. A huge number like this should result in pretty successful problem-solving — and company-reported benchmarking results suggest it does.

The model offers impressive results when compared to the latest available o1 version on AIME 2024, MATH-500 and SWE-bench Verified benchmarks, according to data shared by the company on Monday.

These are designed to assess capability in math, word problems and programming, respectively.

DeepSeek-R1 performance compared to OpenAI o1 and other rivals

But DeepSeek-R1 falls short on Codeforces, GPQA Diamond and MMLU benchmarks, which estimate coding, scientific reasoning and more general academic skills.

Smaller "distilled" versions of the model are also available, including one that can run on a laptop. This will be particularly exciting for indie hackers who want to experiment deeply with AI models (or simply cut costs).

You can see how these smaller models perform compared to other models from from OpenAI, Anthropic and Alibaba here:

🚀 DeepSeek-R1 is here!

⚡ Performance on par with OpenAI-o1
📖 Fully open-source model & technical report
🏆 MIT licensed: Distill & commercialize freely!

🌐 Website & API are live now! Try DeepThink at https://t.co/v1TFy7LHNy today!

🐋 1/n pic.twitter.com/7BlpWAPu6y
— DeepSeek (@deepseek_ai) January 20, 2025

DeepSeek-R1 and its "distilled" models are freely available via HuggingFace.

As DeepSeek is based in China, its models have to abide by the country's censorship rules and espouse "core socialist values." This means they have major blindspots when it comes to unflattering questions about China's government and history.

DeepSeek-R1 tends to duck such queries, responding with "Sorry, I'm not sure how to approach this type of question yet."

DeepSeek-R1 ducks a question on China's Uyghur population

Costs

DeepSeek-R1's extremely competitive prices might be enough for users to overlook its blindspots. It's currently far cheaper to use than OpenAI's o1 however you interact with it. Users can chat with the model for free for up to 50 messages a day, while developers can build with its API for less than a dollar per million input tokens.

API costs compared

DeepSeek-R1: $0.55 per million input tokens, $0.14 per million input cached tokens and $2.19 per million output tokens
OpenAI o1: $15 per million input tokens, $7.50 per million cached input tokens and $60 per million output tokens

Although these are the only way to feasibly access DeepSeek-R1 itself, those with the right hardware can try out the smallest version on their own computer, potentially cutting out API fees altogether.

The next few weeks should have plenty in store for indie hackers interested in reasoning models. AliBaba announced a preview of its own reasoning model just days after DeepSeek announced a first-glimpse model of R1 in November. It's possible a fuller version of QwQ isn't far behind.

And of course, OpenAI is weeks away from dropping a preview version of o3 — a second-generation reasoning model it sees as key to its efforts to reach the holy grail of artificial general intelligence.

Katie is a journalist for Indie Hackers who specializes in tech, startups, exclusive investigations, and breaking news. She's written for Forbes, Newsweek, and more. She's also an indie hacker herself, working on EasyFOI.

Say something nice to krhignett…

Post Comment

1

I’ve actually used DeepSeek’s reasoning model in a Scalability project, and it’s impressive how efficiently it handles complex reasoning tasks. It made prototyping intelligent features much faster and lighter on resources.

Frank_dassut

·
a month ago
·
Reply
·
1

Already using DeepSeek on my newer products. At 1/10th the cost, it's a no brainer

programmingnewbie

·
9 months ago
·
Reply
·
1

Let’s see which one will keep up in the long term, beyond trends … 🙀

💡 Pierre-Henry 🏁

·
9 months ago
·
Reply
·
1

DeepSeek's release of its reasoning model is a bold move in the AI field, aiming to rival OpenAI's o1. This could bring fresh advancements in AI's ability to reason and understand complex data. If you're building or managing a website to promote such innovations, consider using reliable WordPress Hosting for optimal performance and security.

Ambition Host

·
9 months ago
·
Reply
·
1

Awesome to see more competition in the AI space! What makes DeepSeek’s reasoning model stand out compared to OpenAI’s? Curious to hear more about its unique strengths

kristina13

·
9 months ago
·
Reply
·
1

DeepSeek is a goat of AI's right now. As a software engineer, I was using ChatGPT, then I saw some dude's tweet on X, he recommended to use DeepSeek. Hell he was right, so much better performance and outcome! Hope they will compete OpenAI =)

Arthur

·
9 months ago
·
Reply
·
1

DeepSeek-R1 offers an impressive blend of advanced reasoning capabilities, accessibility, and affordability, making it a strong contender in the AI space.

Sonu@Sitebot | Digital Marketing & SEO | BD

·
9 months ago
·
Reply
·
1

Been using Deepseek for a while now, really great and pretty much on par with all the leading edge stuff. R1's great as well, just doesn't have the largest of context windows.

Viktor

·
9 months ago
·
Reply
·
1. 1
  
  AI is rapidly advancing, and models like DeepSeek-R1 are proof of how reasoning models are becoming more powerful in problem-solving and programming. While AI is revolutionizing different industries, content creators and social media users are also leveraging new tools to enhance their digital experience. One such tool that I found incredibly useful is. It allows users to download TikTok videos in HD quality without a watermark, making it a great option for repurposing content across different platforms. If you're into AI-driven tools, you might want to explore solutions like this that simplify content creation. For more Details Visit tikdownloader. uk
  
  Tikdownloder
  
  ·
  8 months ago
  ·
  Reply
  ·