Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models Paper • 2504.02273 • Published Apr 3, 2025 • 7
ROOT: Robust Orthogonalized Optimizer for Neural Network Training Paper • 2511.20626 • Published Nov 25, 2025 • 43