17 6 25

yang

AI & ML interests

None yet

Recent Activity

new activity 4 days ago

zai-org/GLM-5.2:We need some Air or at least some Flash

new activity 2 months ago

QuantTrio/Qwen3.6-35B-A3B-AWQ:AWQ - 8 Bit

new activity 2 months ago

Qwen/Qwen3.6-35B-A3B:Excellent SVG improve since last version!

View all activity

Organizations

None yet

New activity in zai-org/GLM-5.2 4 days ago

We need some Air or at least some Flash

❤️ 163

#3 opened 6 days ago by

jacek2024

New activity in QuantTrio/Qwen3.6-35B-A3B-AWQ 2 months ago

AWQ - 8 Bit

❤️👀 1

#3 opened 2 months ago by

shrisha

New activity in Qwen/Qwen3.6-35B-A3B 2 months ago

Excellent SVG improve since last version!

👍 2

#22 opened 2 months ago by

New activity in QuantTrio/Qwen3.6-35B-A3B-AWQ 2 months ago

Very good quality tested. On par with Qwen3.5-27b-awq. lot's of thank to QuantTrio

#2 opened 2 months ago by

New activity in QuantTrio/Qwen3.5-27B-AWQ 3 months ago

This is the best quant version in the world,better than FP8

🚀 5

#2 opened 3 months ago by

New activity in Qwen/Qwen3.5-9B 4 months ago

Can we get a 9B-FP8 version next

👍 15

#5 opened 4 months ago by

New activity in Qwen/Qwen3-Coder-Next 4 months ago

SVG improve needed

#35 opened 4 months ago by

New activity in cyankiwi/Qwen3-Coder-Next-AWQ-4bit 5 months ago

how to fix: KeyError: 'model.layers.30.mlp.shared_expert.gate_gate_up_proj.weight'

🔥 1

#1 opened 5 months ago by

New activity in Qwen/Qwen3-VL-235B-A22B-Thinking 9 months ago

How much vram is needed to run this model? 8xRTX3090=192GB isn't enough to run the context.

#12 opened 9 months ago by

FP8/4bit version please

➕ 4

#7 opened 9 months ago by

zhanghx0905

New activity in Qwen/Qwen3-Next-80B-A3B-Thinking-FP8 9 months ago

ValueError: Detected some but not all shards of model.layers.0.linear_attn.in_proj are quantized. All shards of fused layers to have the same precision.

➕ 3

#1 opened 9 months ago by

New activity in Intel/Qwen3-Next-80B-A3B-Thinking-int4-mixed-AutoRound 9 months ago

AttributeError: 'FusedMoE' object has no attribute 'moe'

#1 opened 9 months ago by

New activity in cyankiwi/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit 9 months ago

Any idea on how to fix this: KeyError: 'layers.31.mlp.shared_expert.down_proj.weight'

#1 opened 9 months ago by

New activity in zai-org/GLM-4.5-Air 10 months ago

Multiple function_tool call needed

#12 opened 10 months ago by

New activity in OPEA/gemma-3-27b-it-int4-AutoRound about 1 year ago

so consider build a model for GPU?

#1 opened over 1 year ago by

New activity in Qwen/QVQ-72B-Preview over 1 year ago

Supports function calls/structured outputs

#2 opened over 1 year ago by

luijait

New activity in kosbu/QVQ-72B-Preview-AWQ over 1 year ago

I am waiting for your release, just wait here

🚀➕ 2

#1 opened over 1 year ago by

New activity in Qwen/QVQ-72B-Preview over 1 year ago

Supports function calls/structured outputs

#2 opened over 1 year ago by

luijait

yang

AI & ML interests

Recent Activity

Organizations

kq's activity

We need some Air or at least some Flash

AWQ - 8 Bit

Excellent SVG improve since last version!

Very good quality tested. On par with Qwen3.5-27b-awq. lot's of thank to QuantTrio

This is the best quant version in the world,better than FP8

Can we get a 9B-FP8 version next

SVG improve needed

how to fix: KeyError: 'model.layers.30.mlp.shared_expert.gate_gate_up_proj.weight'

How much vram is needed to run this model? 8xRTX3090=192GB isn't enough to run the context.

FP8/4bit version please

ValueError: Detected some but not all shards of model.layers.0.linear_attn.in_proj are quantized. All shards of fused layers to have the same precision.

AttributeError: 'FusedMoE' object has no attribute 'moe'

Any idea on how to fix this: KeyError: 'layers.31.mlp.shared_expert.down_proj.weight'

Multiple function_tool call needed

so consider build a model for GPU?

Supports function calls/structured outputs

I am waiting for your release, just wait here

Supports function calls/structured outputs