what/llm/can/i/run
models/google-gemma-3-4b-it

Gemma 3 4B

Sources arena, livebench · Confidence B
params
4B
dense
bench sources
2
mid conf
context
max tokens
license
Gemma
see source

Model meta

canonical
google/gemma-3-4b-it
parameters
4B
organization
Google
license
Gemma
context
downloads

Benchmark breakdown

LiveBench
language15.1
overall15.1

Hardware fit matrix

by gpu tierQ2_K
1.5 GB
Q4_K_M
2.0 GB
Q5_K_M
2.5 GB
Q6_K
3.0 GB
Q8_0
4.0 GB
FP16
8.0 GB
G8 GB tier
rtx 3050 · 4060
GREAT
130 t/s
GREAT
104 t/s
GREAT
86 t/s
GREAT
74 t/s
GREAT
57 t/s
G12 GB tier
rtx 3060 · 4070
GREAT
193 t/s
GREAT
154 t/s
GREAT
128 t/s
GREAT
110 t/s
GREAT
85 t/s
GREAT
45 t/s
G16 GB tier
rtx 4080 · 4060 ti 16g
GREAT
289 t/s
GREAT
231 t/s
GREAT
192 t/s
GREAT
164 t/s
GREAT
128 t/s
GREAT
67 t/s
G24 GB tier
rtx 3090 · 4090
GREAT
467 t/s
GREAT
373 t/s
GREAT
310 t/s
GREAT
266 t/s
GREAT
206 t/s
GREAT
109 t/s
G32 GB tier
rtx 5090 · m3 max
GREAT
867 t/s
GREAT
692 t/s
GREAT
576 t/s
GREAT
493 t/s
GREAT
383 t/s
GREAT
202 t/s
G48 GB tier
a6000 · m3 max 64
GREAT
280 t/s
GREAT
223 t/s
GREAT
186 t/s
GREAT
159 t/s
GREAT
123 t/s
GREAT
65 t/s
G80 GB tier
h100 · m3 ultra 128
GREAT
949 t/s
GREAT
758 t/s
GREAT
630 t/s
GREAT
540 t/s
GREAT
419 t/s
GREAT
221 t/s
full fit · production speed tight fit · usabledoesn't fit
Get personalized ranking
Tell us your machine — we'll tell you if this is actually your best pick, or what's better.
Rank for my hardware