Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning Paper • 2507.06485 • Published Jul 9, 2025 • 4
Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding Paper • 2506.06275 • Published Jun 6, 2025
MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation Paper • 2506.17113 • Published Jun 20, 2025 • 5
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning? Paper • 2510.06036 • Published Oct 7, 2025 • 6
Planning with Sketch-Guided Verification for Physics-Aware Video Generation Paper • 2511.17450 • Published Nov 21, 2025 • 2
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning Paper • 2512.02425 • Published Dec 2, 2025 • 24
MedForget: Hierarchy-Aware Multimodal Unlearning Testbed for Medical AI Paper • 2512.09867 • Published 26 days ago
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 3 days ago • 39
Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding Paper • 2506.06275 • Published Jun 6, 2025
Multilingual Contextualization of Large Language Models for Document-Level Machine Translation Paper • 2504.12140 • Published Apr 16, 2025
Aligning Neural Machine Translation Models: Human Feedback in Training and Inference Paper • 2311.09132 • Published Nov 15, 2023
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings Paper • 2411.05986 • Published Nov 8, 2024
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models Paper • 2506.07177 • Published Jun 8, 2025 • 23
Bitwidth Heterogeneous Federated Learning with Progressive Weight Dequantization Paper • 2202.11453 • Published Feb 23, 2022
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization Paper • 2504.08641 • Published Apr 11, 2025 • 6
Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning Paper • 2506.03525 • Published Jun 4, 2025 • 6
EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance Paper • 2505.21876 • Published May 28, 2025 • 9
Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems Paper • 2504.09763 • Published Apr 14, 2025 • 12
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning Paper • 2503.05641 • Published Mar 7, 2025 • 2
RSQ: Learning from Important Tokens Leads to Better Quantized LLMs Paper • 2503.01820 • Published Mar 3, 2025 • 2