MatSpray: Fusing 2D Material World Knowledge on 3D Geometry Paper • 2512.18314 • Published 7 days ago • 7
PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence Paper • 2512.16793 • Published 8 days ago • 71
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation Paper • 2512.17012 • Published 8 days ago • 41
3D-RE-GEN: 3D Reconstruction of Indoor Scenes with a Generative Framework Paper • 2512.17459 • Published 8 days ago • 11
GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation Paper • 2512.17495 • Published 8 days ago • 18
Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics Paper • 2512.15340 • Published 10 days ago • 1
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 10 days ago • 64
SS4D: Native 4D Generative Model via Structured Spacetime Latents Paper • 2512.14284 • Published 11 days ago • 13
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper • 2512.10881 • Published 15 days ago • 29
ProPhy: Progressive Physical Alignment for Dynamic World Simulation Paper • 2512.05564 • Published 22 days ago • 5
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image Paper • 2512.05044 • Published 22 days ago • 16
3D-LLM: Injecting the 3D World into Large Language Models Paper • 2307.12981 • Published Jul 24, 2023 • 38
SIMA 2: A Generalist Embodied Agent for Virtual Worlds Paper • 2512.04797 • Published 23 days ago • 23
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling Paper • 2512.03000 • Published 24 days ago • 36
Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model Paper • 2512.01030 • Published 26 days ago • 19
MiMo-Embodied: X-Embodied Foundation Model Technical Report Paper • 2511.16518 • Published Nov 20 • 25
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning Paper • 2510.27606 • Published Oct 31 • 28
Error-Driven Scene Editing for 3D Grounding in Large Language Models Paper • 2511.14086 • Published Nov 18 • 6