Inference Providers
Active filters: Sa2VA
Image-Text-to-Text
• 4B • Updated • 5
Dense-World/Sa2VA_InternVL2.5_4b
Image-Text-to-Text
• 4B • Updated • 10
• 1
Dense-World/Sa2VA_InternVL2.5_8b
Image-Text-to-Text
• 8B • Updated • 10
Dense-World/Sa2VA_InternVL2.5_26b
Image-Text-to-Text
• 26B • Updated • 12
Image-Text-to-Text
• 4B • Updated • 1.52k
• 99
Image-Text-to-Text
• 8B • Updated • 5.03k
• 66
Image-Text-to-Text
• 1B • Updated • 655
• 30
Image-Text-to-Text
• 26B • Updated • 136
• 32
Image Segmentation
• 4B • Updated • 22
Image Segmentation
• 1B • Updated • 12
Image Segmentation
• 8B • Updated • 229
Image Segmentation
• 26B • Updated • 12
ByteDance/Sa2VA-InternVL3-2B
Image-Text-to-Text
• 2B • Updated • 303
• 2
ByteDance/Sa2VA-InternVL3-8B
Image-Text-to-Text
• 8B • Updated • 106
• 5
ByteDance/Sa2VA-InternVL3-14B
Image-Text-to-Text
• 15B • Updated • 142
• 10
ByteDance/Sa2VA-Qwen2_5-VL-3B
Image-Text-to-Text
• 4B • Updated • 82
• 3
ByteDance/Sa2VA-Qwen2_5-VL-7B
Image-Text-to-Text
• 9B • Updated • 185
• 5
ByteDance/Sa2VA-Qwen3-VL-4B
Image-Text-to-Text
• 5B • Updated • 1.19k
• 16
ByteDance/Sa2VA-Qwen3-VL-2B
Image-Text-to-Text
• 3B • Updated • 114
• 17