SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences? Paper • 2604.10718 • Published 4 days ago • 3
The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning Paper • 2604.06427 • Published 9 days ago • 11
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 3 days ago • 24
The ATOM Report: Measuring the Open Language Model Ecosystem Paper • 2604.07190 • Published 8 days ago • 5
MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models Paper • 2603.25744 • Published 21 days ago • 13
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 24 days ago • 46
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published 30 days ago • 109
When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning Paper • 2603.21289 • Published 25 days ago • 35
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought Paper • 2603.22847 • Published 23 days ago • 26
Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models Paper • 2603.24844 • Published 22 days ago • 10
Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs Paper • 2603.22446 • Published 24 days ago • 10
mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT Paper • 2603.21606 • Published 24 days ago • 39
Scalable Prompt Routing via Fine-Grained Latent Task Discovery Paper • 2603.19415 • Published 28 days ago • 7
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning Paper • 2603.23483 • Published 23 days ago • 62
Understanding the Challenges in Iterative Generative Optimization with LLMs Paper • 2603.23994 • Published 22 days ago • 28
Learning to Commit: Generating Organic Pull Requests via Online Repository Memory Paper • 2603.26664 • Published 20 days ago • 9
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 21 days ago • 131