what/llm/can/i/run
models/deepseek-ai-deepseek-r1

DeepSeek R1

Sources arena, livebench · Confidence B
params
671B
MoE total
bench sources
2
mid conf
context
max tokens
license
MIT
permissive

Model meta

canonical
deepseek-ai/DeepSeek-R1
parameters
671B
organization
DeepSeek
license
MIT
context
downloads

Benchmark breakdown

LiveBench
instruction following80.6
coding70.3
language49.4
overall66.8

Hardware fit matrix

by gpu tierQ2_K
251.6 GB
Q4_K_M
335.5 GB
Q5_K_M
419.4 GB
Q6_K
503.3 GB
Q8_0
671.0 GB
FP16
1342.0 GB
G8 GB tier
rtx 3050 · 4060
G12 GB tier
rtx 3060 · 4070
G16 GB tier
rtx 4080 · 4060 ti 16g
G24 GB tier
rtx 3090 · 4090
G32 GB tier
rtx 5090 · m3 max
G48 GB tier
a6000 · m3 max 64
G80 GB tier
h100 · m3 ultra 128
full fit · production speed tight fit · usabledoesn't fit
Get personalized ranking
Tell us your machine — we'll tell you if this is actually your best pick, or what's better.
Rank for my hardware