Inference Providers
Active filters: ollama
rico03/Qwen3.6-27B-Claude-Opus-Reasoning-Distilled-GGUF
Text Generation
• 27B • Updated • 35.8k
• 26
DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated • 187
recursivecurse/qwen2.5-coder-3b-vitest.Q4_K_M.gguf
Text Generation
• 3B • Updated • 77
• 1
continuum-ai/qwen3.5-4b-code-forged-GGUF
Text Generation
• 4B • Updated • 4.69k
• 6
deadbydawn101/gemma-4-E4B-Agentic-Opus-Reasoning-GeminiCLI-GGUF
Text Generation
• 8B • Updated • 3.07k
• 3
batiai/Qwen3.6-35B-A3B-GGUF
Text Generation
• 35B • Updated • 11k
• 2
nimendraai/NuExtract-tiny-Resume-Data-Extractor
Text Generation
• 0.5B • Updated • 824
• 4
mahmoudalyosify/Horus-OSINT
Text Generation
• 8B • Updated • 116
• 1
Image-Text-to-Text
• 2B • Updated • 224
• 1
B4lt/gemma4-loghub-e2b-GGUF
Text Generation
• 5B • Updated • 235
• 1
Text Generation
• 2B • Updated • 304
• 2
pacozaa/mistral-unsloth-chatml-first
4B • Updated • 116
pacozaa/tinyllama-alpaca-lora
7B • Updated • 36
pacozaa/TinyLlama-1.1B-intermediate-step-1431k-3T-GGUF
1B • Updated • 44
pacozaa/mistral-sharegpt90k
pacozaa/mistral-sharegpt90k-merged_16bit
Text Generation
• 7B • Updated • 13
TrabEsrever/dolphin-2.9-llama3-70b-GGUF
Updated
daekeun-ml/Phi-3-medium-4k-instruct-ko-poc-gguf-v0.1
Text Generation
• 14B • Updated • 55
• 1
hierholzer/Llama-3.1-70B-Instruct-GGUF
Text Generation
• 71B • Updated • 1.54k
• 3
LucasInsight/Meta-Llama-3.1-8B-Instruct
8B • Updated • 117
• 1
LucasInsight/Meta-Llama-3-8B-Instruct
8B • Updated • 122
Shyamnath/Llama-3.2-3b-Uncensored-GGUF
Text Generation
• 4B • Updated • 283
• 4
ghost-x/ghost-8b-beta-1608-gguf
Text Generation
• 8B • Updated • 388
• 6
cahaj/Phi-3.5-mini-instruct-text2sql-GGUF
4B • Updated • 70
Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python_Spanish_English_16bit
0.5B • Updated Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-extra_small_quantization_GGUF_3bit
0.5B • Updated Agnuxo/Tinytron-Qwen-0.5B-Instruct_CODE_Python-Spanish_English_GGUF_4bit
0.5B • Updated Agnuxo/Tinytron-Qwen-0.5B-TinyLlama-Instruct_CODE_Python-Spanish_English_GGUF_q5_k
0.5B • Updated