Shihan Qu
zenmagnets
AI & ML interests
None yet
Recent Activity
liked a model 3 days ago
brandonmusic/GLM-5.2-NVFP4-REAP-Recall-N172 liked a model 5 days ago
madeby561/GLM-5.2-NVFP4-REAP-504B liked a model 10 days ago
brandonmusic/MiniMax-M3-NVFP4Organizations
None yet
Qwen3.6 397b
3
#75 opened 2 months ago
by
zenmagnets
31gb NVFP4 Model?
4
#1 opened 2 months ago
by
zenmagnets
license
👀👍 9
16
#5 opened 2 months ago
by
festr2
Pending GPU & vLLM validation
3
#1 opened 3 months ago
by
nwzjk
No commercial use allowed in License?
👀😔 4
10
#6 opened 2 months ago
by
zenmagnets
How to run on vLLM for 4xSM120
#1 opened 4 months ago
by
zenmagnets
Here's the vLLM recipe I'm using with 2x RTX Pro 6000
👍 3
17
#1 opened 4 months ago
by
zenmagnets
Anyone get this working on 4x RTX 6000 Pro?
👀 2
5
#1 opened 4 months ago
by
zenmagnets
Throughput NVFP4 on Dual 6000 Blackwells
#2 opened 4 months ago
by
zenmagnets
Anyone try this on 4x RTX 6000 Pro yet?
52
#1 opened 4 months ago
by
zenmagnets
I wish it would fit in 2x6000 PRO!
1
#2 opened 4 months ago
by
mtcl
"w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected."
👍 1
21
#2 opened 4 months ago
by
zenmagnets
Wasn't able to recreate MMLU-Pro benchmarks
5
#5 opened 5 months ago
by
zenmagnets
Enormous KV-cache size?
👍➕ 6
23
#3 opened 5 months ago
by
nephepritou
Really appreciate that you ran performance comparison tests with BF16!
3
#2 opened 5 months ago
by
zenmagnets
Performance comps with BF16?
1
#3 opened 5 months ago
by
zenmagnets
Any plans for a 6bit or 8bit version?
1
#3 opened 5 months ago
by
zenmagnets
If 8bit, why shaped like 16 bit
2
#2 opened 5 months ago
by
zenmagnets
6 months since intro of NVFP4, and it's basically still a myth
1
#4 opened 7 months ago
by
zenmagnets
Works with vllm? Any recommendations or howtos?
7
#1 opened 8 months ago
by
DrRos