Question 1

How much VRAM does DeepSeek R1 671B need?

Accepted Answer

The full DeepSeek R1 is a 671B Mixture of Experts model. At INT4, it needs ~335 GB VRAM — requiring 8× A100 80GB or 5× H100 80GB. In practice, most users run the distilled variants (7B–70B) which offer strong reasoning on consumer hardware.

Question 2

What are the DeepSeek R1 distilled models?

Accepted Answer

DeepSeek released smaller distilled versions trained from R1: R1-Distill-Qwen-1.5B, 7B, 14B, 32B and R1-Distill-Llama-8B, 70B. The 7B distill runs at GGUF Q4 on any 8GB GPU. The 70B distill runs at INT4 on an A100 40GB, with reasoning quality close to the full 671B model.

Question 3

Is DeepSeek R1 better than GPT-4 for reasoning?

Accepted Answer

DeepSeek R1 matches or exceeds GPT-4o on AIME 2024, Codeforces, and MATH benchmarks — at a fraction of the training cost. For open-source local deployment, R1-Distill-70B-INT4 is the strongest reasoning model available below $2/hr cloud cost.

Question 4

How do I run DeepSeek R1 locally?

Accepted Answer

Use Ollama: `ollama run deepseek-r1:7b` (for the 7B distill, ~4.5GB VRAM) or `ollama run deepseek-r1:70b` (for 70B distill at Q4, ~40GB VRAM). For the full 671B model you need a multi-GPU cluster — use vLLM with tensor parallelism.

Model	Params	VRAM (INT4)	Minimum GPU
R1 full	671B MoE	~335 GB	8× A100 80GB
R1-Distill-70B	70B dense	~38 GB	A100 40GB
R1-Distill-32B	32B dense	~18 GB	RTX 4090 24GB (tight)
R1-Distill-14B	14B dense	~8 GB	RTX 3070 8GB
R1-Distill-7B	7B dense	~4.5 GB	Any 6GB+ GPU
R1-Distill-1.5B	1.5B dense	~1 GB	CPU-only feasible

DeepSeek R1 VRAM Calculator

DeepSeek R1: Full Model vs Distilled Variants

VRAM by Model Size at INT4

Why the Distilled Models Are Remarkable

MoE Memory Note

Frequently Asked Questions

How much VRAM does DeepSeek R1 671B need?

What are the DeepSeek R1 distilled models?

Is DeepSeek R1 better than GPT-4 for reasoning?

How do I run DeepSeek R1 locally?

Related Tools

Related Guides

How Much VRAM Do You Need to Run LLMs? A Practical Guide