Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video Paper • 2605.15182 • Published 12 days ago • 39
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 12 days ago • 81
Omni-Persona: Systematic Benchmarking and Improving Omnimodal Personalization Paper • 2605.09996 • Published 15 days ago • 8
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 23 days ago • 163
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published Apr 24 • 226
Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting Paper • 2604.12626 • Published Apr 14 • 15
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens Paper • 2603.23516 • Published Mar 6 • 50
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 265
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published Jan 6 • 104
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark Paper • 2509.24897 • Published Sep 29, 2025 • 46
SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression Paper • 2509.25176 • Published Sep 29, 2025 • 14
MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training Paper • 2501.07556 • Published Jan 13, 2025 • 7
BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation Paper • 2504.07955 • Published Apr 10, 2025 • 2