view article Article Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens 16 days ago • 4
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 63