SynthLabsAI/ALP_DeepScaleR_1.5B_C16K
Reinforcement Learning
β’
2B
β’
Updated
β’
17
β’
3
Scaling up good synthetic reasoning. Post-training and synthetic data research lab.