Member-only story

DeepSeek R1 Distilled Models in Ollama: Not What You Think

DeepSeek R1’s distilled models in Ollama sound like smaller versions of the original, but are they really?

Kshitij Darwhekar

Published in

Towards AI

5 min readJan 30, 2025

Don’t have a paid Medium membership (yet)? You can read the entire article for free by clicking here with my friend’s link

Introduction

DeepSeek recently introduced its models, including DeepSeek V3 and DeepSeek R1. These models have gained significant popularity in the AI community and on social media due to their impressive performance compared to models like OpenAI’s o1. Unlike OpenAI’s models, DeepSeek’s models are fully open-source and free to use.

DeepSeek Models in comparison with OpenAI’s models.

Since DeepSeek models are open-source and licensed under MIT, they are free to use for both personal and commercial purposes, and you can even run them locally. However, unless you have an insanely powerful machine, you won’t be able to run DeepSeek R1 on your local setup.

That’s where the smaller distilled models come in. DeepSeek has not only released the R1:671B model but also several dense models that are widely used in the research community.

DeepSeek R1 Distilled Models in Ollama: Not What You Think

DeepSeek R1’s distilled models in Ollama sound like smaller versions of the original, but are they really?

Introduction

Published in Towards AI

Written by Kshitij Darwhekar

No responses yet