SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning Paper • 2603.23483 • Published 6 days ago • 58
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 13 days ago • 247
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 13 days ago • 247
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published 13 days ago • 247
laion/CLIP-ViT-L-14-laion2B-s32B-b82K Zero-Shot Image Classification • 0.4B • Updated Jan 16, 2024 • 313k • 63