Zachary Bessinger

zbessinger

https://www.zachbessinger.com

AI & ML interests

Multimodal Computer Vision

Recent Activity

liked a model 18 days ago

Qwen/Qwen3-VL-30B-A3B-Instruct

liked a Space 28 days ago

WildVision/vision-arena

upvoted a paper 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

View all activity

Organizations

None yet

liked a model 18 days ago

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 815k • • 516

liked a Space 28 days ago

Vision Arena (Testing VLMs side-by-side)

🖼

559

Display image analysis results

liked 3 Spaces 3 months ago

Open VLM Leaderboard

🌎

974

VLMEvalKit Evaluation Results Collection

Transformers Timeline

🤗

Interactive timeline to explore the 🤗Transformers models

The Smol Training Playbook

📚

2.93k

The secrets to building world-class LLMs

liked a model 3 months ago

zai-org/GLM-4.6-FP8

Text Generation • 358B • Updated Oct 16, 2025 • 9.09k • • 97

liked 2 models 5 months ago

merve/smol-vision

Image-Text-to-Text • Updated Nov 5, 2025 • 191

kudzueye/boreal-qwen-image

Text-to-Image • Updated Sep 5, 2025 • 7.69k • • 124

liked a model 8 months ago

TIGER-Lab/VLM2Vec-Qwen2VL-7B

Image-to-Text • Updated May 3, 2025 • 3.8k • 10

liked a Space 8 months ago

MMEB Leaderboard

📊

The massive multimodal embedding benchmark

liked a model 8 months ago

DeepGlint-AI/UniME-LLaVA-OneVision-7B

Image-Text-to-Text • 8B • Updated May 7, 2025 • 19 • 3

liked 2 models 12 months ago

ByteDance/Sa2VA-8B

Image-Text-to-Text • 8B • Updated Sep 8, 2025 • 924 • 65

yayayaaa/florence-2-large-ft-moredetailed

Image-to-Text • 0.8B • Updated Dec 13, 2025 • 100 • 15

liked 3 models about 1 year ago

liked 4 Spaces over 1 year ago

Florence2 + SAM2

🔥

515

Segment and caption objects in images and videos

FLUX.1 [Inpainting]

🎨

642

FLUX.1 [Schnell]

🏎

5.04k

Generate unique images from text descriptions

Vgg Heads

🖼