What Token Generation are you guys getting with CPU only?
#3 opened 26 days ago
by
zenmagnets
Qwen/Qwen3-Next-80B-A3B-Thinking has MMLU_PRO 82.7 but you guys get 0.7271
2
#2 opened 3 months ago
by
hlxxxxxx
Difference between int4-mixed and int4
1
#1 opened 3 months ago
by
whoisjeremylam