From Perception to Action: An Interactive Benchmark for Vision Reasoning Paper • 2602.21015 • Published 1 day ago • 20
SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation Paper • 2602.16863 • Published 7 days ago • 14
ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequential Recommendation Paper • 2602.20093 • Published 2 days ago • 23
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published 13 days ago • 97
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents Paper • 2602.14234 • Published 10 days ago • 26
AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines Paper • 2602.14296 • Published 10 days ago • 47
Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents Paper • 2602.16855 • Published 11 days ago • 43
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device Paper • 2602.20161 • Published 2 days ago • 21
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics Paper • 2602.19313 • Published 3 days ago • 22
RL's Razor: Why Online Reinforcement Learning Forgets Less Paper • 2509.04259 • Published Sep 4, 2025 • 7
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models Paper • 2512.07783 • Published Dec 8, 2025 • 39
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published Nov 24, 2025 • 62
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published 17 days ago • 202
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published 12 days ago • 43
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published 11 days ago • 50
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published 13 days ago • 52
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs Paper • 2602.12705 • Published 13 days ago • 61