yang
kq
AI & ML interests
None yet
Recent Activity
new activity 4 days ago
zai-org/GLM-5.2:We need some Air or at least some Flash new activity 2 months ago
QuantTrio/Qwen3.6-35B-A3B-AWQ:AWQ - 8 Bit new activity 2 months ago
Qwen/Qwen3.6-35B-A3B:Excellent SVG improve since last version!Organizations
None yet
We need some Air or at least some Flash
❤️ 163
39
#3 opened 6 days ago
by
jacek2024
AWQ - 8 Bit
❤️👀 1
1
#3 opened 2 months ago
by
shrisha
Excellent SVG improve since last version!
👍 2
1
#22 opened 2 months ago
by
kq
Very good quality tested. On par with Qwen3.5-27b-awq. lot's of thank to QuantTrio
3
#2 opened 2 months ago
by
kq
This is the best quant version in the world,better than FP8
🚀 5
4
#2 opened 3 months ago
by
kq
Can we get a 9B-FP8 version next
👍 15
5
#5 opened 4 months ago
by
kq
SVG improve needed
#35 opened 4 months ago
by
kq
how to fix: KeyError: 'model.layers.30.mlp.shared_expert.gate_gate_up_proj.weight'
🔥 1
2
#1 opened 5 months ago
by
kq
How much vram is needed to run this model? 8xRTX3090=192GB isn't enough to run the context.
1
#12 opened 9 months ago
by
kq
FP8/4bit version please
➕ 4
5
#7 opened 9 months ago
by
zhanghx0905
AttributeError: 'FusedMoE' object has no attribute 'moe'
2
#1 opened 9 months ago
by
kq
Any idea on how to fix this: KeyError: 'layers.31.mlp.shared_expert.down_proj.weight'
4
#1 opened 9 months ago
by
kq
Multiple function_tool call needed
1
#12 opened 10 months ago
by
kq
so consider build a model for GPU?
3
#1 opened over 1 year ago
by
kq
Supports function calls/structured outputs
4
#2 opened over 1 year ago
by
luijait
I am waiting for your release, just wait here
🚀➕ 2
3
#1 opened over 1 year ago
by
kq
Supports function calls/structured outputs
4
#2 opened over 1 year ago
by
luijait