AI-Insight
's Collections
💡HF Papers Live 4: Multi Modal models
updated
Image-Text-to-Text
•
241B
•
Updated
•
46.7k
•
250
Intern-S1: A Scientific Multimodal Foundation Model
Paper
•
2508.15763
•
Published
•
259
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Paper
•
2408.01800
•
Published
•
88
Image-Text-to-Text
•
9B
•
Updated
•
40.9k
•
1.04k
Image-Text-to-Text
•
4B
•
Updated
•
56k
•
460
Image-Text-to-Text
•
108B
•
Updated
•
31.8k
•
•
701
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
•
10B
•
Updated
•
117k
•
•
762
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable
Reinforcement Learning
Paper
•
2507.01006
•
Published
•
250
Paper
•
2508.11737
•
Published
•
111
Image-Text-to-Text
•
3B
•
Updated
•
51.8k
•
199
Image-Text-to-Text
•
321B
•
Updated
•
63.8k
•
164