OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448 Video-Text-to-Text • 2B • Updated Mar 16, 2025 • 1.09k • 27
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 324k • 1.58k