Hanan Tabak 's picture

Hanan Tabak

Hanan-Tabak

·

AI & ML interests

None yet

Organizations

commented a paper 11 months ago

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

Paper • 2507.08267 • Published Jul 11, 2025 • 11 •