arxiv:2603.05369
Sky
dandingsky
AI & ML interests
None yet
Recent Activity
commentedon a paper 19 days ago
Progressive Residual Warmup for Language Model Pretraining submitted a paper 20 days ago
Progressive Residual Warmup for Language Model Pretraining authored a paper 22 days ago
Thinking-Free Policy Initialization Makes Distilled Reasoning Models
More Effective and Efficient Reasoners