Inference Providers
Active filters: vLLM
mistralai/Mistral-Medium-3.5-128B
128B • Updated • 256k
• 337
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 62.8k
• 379
unsloth/Mistral-Small-4-119B-2603-GGUF
119B • Updated • 13.7k
• 69
QuantTrio/Qwen3.6-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 845k
• 24
mistralai/Mistral-Small-4-119B-2603-eagle
Updated • 184
• 52
mradermacher/Mistral-Small-4-119B-2603-i1-GGUF
119B • Updated • 2.03k
• 1
QuantTrio/Qwen3.6-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 886k
• 11
QuantTrio/Qwen3.6-27B-AWQ-6Bit
Image-Text-to-Text
• 28B • Updated • 31.5k
• 9
RecViking/Mistral-Medium-3.5-128B-NVFP4
74B • Updated • 10.7k
• 5
cyankiwi/Mistral-Medium-3.5-128B-AWQ-INT4
25B • Updated • 17.2k
• 2
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 516k
• 8
Image-Text-to-Text
• 10B • Updated • 241k
• 17
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 28
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 9
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 57
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 59
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 6
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 191
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 460
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 224
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 21
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 2.7k
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 25
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 26.8k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 442
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 23
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 90
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 121k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 811
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
• 8B • Updated • 559
• 4