what/llm/can/i/run
models/01-ai-yi-1.5-34b-chat

YYi 1.5 34B

Sources arena · Confidence C
params
34B
dense
bench sources
1
single
context
max tokens
license
Apache-2.0
permissive

Model meta

canonical
01-ai/Yi-1.5-34B-Chat
parameters
34B
organization
01 AI
license
Apache-2.0
context
downloads

Benchmark breakdown

Hardware fit matrix

by gpu tierQ2_K
12.8 GB
Q4_K_M
17.0 GB
Q5_K_M
21.3 GB
Q6_K
25.5 GB
Q8_0
34.0 GB
FP16
68.0 GB
G8 GB tier
rtx 3050 · 4060
G12 GB tier
rtx 3060 · 4070
G16 GB tier
rtx 4080 · 4060 ti 16g
OK
43 t/s
G24 GB tier
rtx 3090 · 4090
GREAT
70 t/s
GREAT
53 t/s
TIGHT
43 t/s
G32 GB tier
rtx 5090 · m3 max
GREAT
130 t/s
GREAT
98 t/s
GREAT
79 t/s
OK
66 t/s
G48 GB tier
a6000 · m3 max 64
GREAT
42 t/s
GREAT
32 t/s
GREAT
25 t/s
GREAT
21 t/s
GREAT
16 t/s
G80 GB tier
h100 · m3 ultra 128
GREAT
142 t/s
GREAT
107 t/s
GREAT
86 t/s
GREAT
72 t/s
GREAT
54 t/s
OK
27 t/s
full fit · production speed tight fit · usabledoesn't fit
Get personalized ranking
Tell us your machine — we'll tell you if this is actually your best pick, or what's better.
Rank for my hardware