-
-
-
-
-
-
Inference Providers
Active filters: open-r1
rkumar1999/Phi-mini-MoE-Prover-openr1-distill-SFT
Text Generation
• 2.99M • Updated
• 13
• 1
jayzou3773/Qwen1.5-MOE-sft-gsm8k
Text Generation
• 14B • Updated
• 47
• 1
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
• 2
yucaiwen/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO
Text Generation
• 8B • Updated
• 11
• 1
JinnP/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
bangan/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
liusq19/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
stepyoun/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
• 3
howey/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
• 4
wxnfifth/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
• 3
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
• 8B • Updated
• 7
Text Generation
• 8B • Updated
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math
Text Generation
• 2B • Updated
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
• 2B • Updated
• 2
skzxjus/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
• 1
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-GGUF
8B • Updated
• 126
skzxjus/Qwen2.5-7B-1m-Open-R1-Distill
Text Generation
• 8B • Updated
• 8
• 4
skzxjus/Qwen2.5-7B-Open-R1-GRPO
Text Generation
• 8B • Updated
• 1
ununtrium/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
• 2B • Updated
• 1
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-i1-GGUF
8B • Updated
• 162
yeshsurya/Qwen2.5-7B-Math-with_50stepGRPO
Text Generation
• 8B • Updated
• 5
mradermacher/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math-GGUF
2B • Updated
• 40
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-GGUF
8B • Updated
• 589
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
• 8B • Updated
• 2
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
• 8B • Updated
• 2
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
• 2B • Updated
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
• 2B • Updated
• 2
yh-yao/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
• 2B • Updated
• 1
Dongwei/Qwen-2.5-7B_Base_Math_smalllr
Text Generation
• 8B • Updated
• 4
• 6