Qwen/Qwen3-VL-30B-A3B-Instruct Image-Text-to-Text β’ 31B β’ Updated Nov 26, 2025 β’ 815k β’ β’ 516
Running Featured 559 Vision Arena (Testing VLMs side-by-side) πΌ 559 Display image analysis results
Running on CPU Upgrade Featured 2.93k The Smol Training Playbook π 2.93k The secrets to building world-class LLMs
yayayaaa/florence-2-large-ft-moredetailed Image-to-Text β’ 0.8B β’ Updated Dec 13, 2025 β’ 100 β’ 15
meta-llama/Llama-3.2-11B-Vision Image-Text-to-Text β’ 11B β’ Updated Sep 27, 2024 β’ 10.4k β’ 579
Runtime error Featured 515 Florence2 + SAM2 π₯ 515 Segment and caption objects in images and videos
Running on Zero Featured 5.04k FLUX.1 [Schnell] π 5.04k Generate unique images from text descriptions