A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning
Paper
• 2507.08267 • Published
• 11
Fast-Math is a model series designed to significantly improve inference efficiency while preserving accuracy on math reasoning tasks.