MattBou00/SingleLR00001_2000samples-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated 23 days ago • 20
MattBou00/SequentialLR00001_2000samples-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated 23 days ago • 14
MattBou00/SequentialLR001_2000samples-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated 23 days ago • 16
MattBou00/SequentialLR001_2000samples-checkpoint-epoch-40 Reinforcement Learning • 1B • Updated 23 days ago • 18
MattBou00/SequentialLR001_2000samples-checkpoint-epoch-60 Reinforcement Learning • 1B • Updated 23 days ago • 21
MattBou00/SequentialLR001_2000samples_R1-checkpoint-epoch-20 Reinforcement Learning • 1B • Updated 23 days ago • 15
MattBou00/SequentialLR001_2000samples_R1-checkpoint-epoch-40 Reinforcement Learning • 1B • Updated 23 days ago • 17
mradermacher/MARSHAL-Generalist-Qwen3-4B-GGUF Reinforcement Learning • 4B • Updated 13 days ago • 426
mradermacher/MARSHAL-Generalist-Qwen3-8B-GGUF Reinforcement Learning • 8B • Updated 13 days ago • 424