Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,402

Full-text search

Active filters: multimodal

jinaai/jina-vlm

Image-Text-to-Text • 2B • Updated 8 days ago • 1.57k • 74

ByteDance/Dolphin-v2

Image-Text-to-Text • 4B • Updated 1 day ago • 105 • 49

Cognitive-Lab/NetraEmbed

Visual Document Retrieval • 4B • Updated 3 days ago • 565 • 20

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 2 days ago • 66.9k • 438

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 285k • 759

OctoMed/OctoMed-7B

Image-Text-to-Text • 8B • Updated 7 days ago • 991 • 16

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 3.13M • • 1.39k

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 331k • 452

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 6.78M • 575

IDEA-Research/Rex-Omni

Image-Text-to-Text • 4B • Updated Oct 16 • 23.5k • 50

stepfun-ai/GELab-Zero-4B-preview

Image-to-Text • 4B • Updated 12 days ago • 991 • 93

jinaai/jina-clip-v2

Feature Extraction • 0.9B • Updated Apr 28 • 154k • 297

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 1.83M • 475

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 149k • 1.83k

cpatonn/Qwen3-Omni-30B-A3B-Instruct-AWQ-4bit

Any-to-Any • 10B • Updated Sep 28 • 26.7k • 35

omlab/VLM-FO1_Qwen2.5-VL-3B-v01

Object Detection • 4B • Updated 16 days ago • 2.15k • 13

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 111k • • 571

Qwen/Qwen3-Omni-30B-A3B-Captioner

Any-to-Any • 32B • Updated Sep 22 • 24.6k • 180

Qwen/Qwen3-Omni-30B-A3B-Thinking

Any-to-Any • 32B • Updated Sep 22 • 55.7k • 233

ByteDance/Dolphin-1.5

Image-Text-to-Text • 0.4B • Updated Nov 12 • 1.52k • 33

lijiayangCS/DiTFuse

Image-to-Image • Updated 11 days ago • 4

thesby/Qwen3-VL-8B-NSFW-Caption-V4.5

Image-to-Text • 9B • Updated Nov 7 • 19.3k • 51

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 1.44M • • 1.25k

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 28.2k • 120

OpenGVLab/InternVideo2_5_Chat_8B

Video-Text-to-Text • 8B • Updated Aug 4 • 44.6k • 87

Qwen/Qwen2.5-VL-7B-Instruct-AWQ

Image-Text-to-Text • 8B • Updated Apr 6 • 191k • 95

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated Jul 16 • 4.33k • 508

unsloth/Qwen2.5-Omni-7B-GGUF

Any-to-Any • 8B • Updated May 28 • 10.2k • 48

OpenGVLab/VideoChat-R1_5-7B

Video-Text-to-Text • 8B • Updated Oct 2 • 11.3k • 10

Kwai-Keye/Keye-VL-671B-A37B

Video-Text-to-Text • 672B • Updated 23 days ago • 146 • 18