Inference Providers
Active filters: llama-3
NousResearch/Meta-Llama-3.1-8B-Instruct
Text Generation
• 8B • Updated • 224k
• • 41
Text Generation
• 3B • Updated • 965k
• 754
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
• 11B • Updated • 12.8k
• 586
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
• 89B • Updated • 2.61k
• 134
hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
• 1B • Updated • 34.6k
• 20
mlx-community/Llama-3.2-3B-Instruct-4bit
Text Generation
• 0.5B • Updated • 43.4k
• 43
lmstudio-community/Llama-3.2-1B-Instruct-GGUF
Text Generation
• 1B • Updated • 10.2k
• 47
unsloth/Llama-3.2-1B-Instruct
Text Generation
• 1B • Updated • 143k
• 92
unsloth/Llama-3.2-3B-Instruct-bnb-4bit
Text Generation
• 3B • Updated • 50.9k
• 34
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text
• 11B • Updated • 7.94k
• 81
Text Generation
• 7B • Updated • 638
• 27
tjake/Llama-3.2-1B-Instruct-JQ4
Text Generation
• Updated • 1.36k
• 4
bartowski/Llama-3.3-70B-Instruct-GGUF
Text Generation
• 71B • Updated • 12.9k
• 73
mradermacher/Llama-3.3-70B-Instruct-abliterated-GGUF
71B • Updated • 675
• 5
DavidAU/L3-MOE-4x8B-Dark-Planet-Rebel-FURY-25B-GGUF
Text Generation
• 25B • Updated • 269
• 6
unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF
Text Generation
• 8B • Updated • 41.5k
• 298
unsloth/Llama-3.1-8B-unsloth-bnb-4bit
Text Generation
• 8B • Updated • 11.4k
• 6
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation
• 50B • Updated • 32k
• 322
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation
• 8B • Updated • 57.2k
• • 222
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1-FP8
Text Generation
• 253B • Updated • 4.36k
• 12
nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1
Text Generation
• 5B • Updated • 25.8k
• 114
nvidia/Llama-3_3-Nemotron-Super-49B-v1-FP8
Text Generation
• 50B • Updated • 4.4k
• 13
Repoaner/llama_guard_vision
Image-Text-to-Text
• 11B • Updated • 5
• 1
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8
Text Generation
• 50B • Updated • 42k
• 27
mradermacher/Llama-3.1-8B-Instruct-heretic-i1-GGUF
8B • Updated • 5.16k
• 3
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4
Text Generation
• 26B • Updated • 13.8k
• 17
JohnsonPedia/llama-3-8b-yoruba-chat-gguf
Text Generation
• 8B • Updated • 33
• 1
srswti/Llama-3.2-11B-Vision-Instruct-abliterated
Image-to-Text
• 11B • Updated • 180
• 1
srswti/Llama-3.2-11B-Vision-Instruct-abliterated-4-bit
Image-to-Text
• 2B • Updated • 343
• 1
mradermacher/Golddiamondgold-Paperbliteration-L33-70b-GGUF
71B • Updated • 1.06k
• 6