AI & ML interests
NLP
Organizations
wzq016/qwen25-entrie-guideline-8k
8B
•
Updated
•
3
wzq016/qwen25-rlrm-filtered-guideline
8B
•
Updated
•
5
wzq016/qwen25-rlrm-entire-guideline
8B
•
Updated
•
4
wzq016/llama3-skywork-rlrm-code-math-grpo-kl
8B
•
Updated
•
4
wzq016/qwen25-skywork-rlrm-code-math-grpo-kl
8B
•
Updated
•
3
wzq016/llama3-skywork-rlrm-new-filtered-grpo-kl
8B
•
Updated
•
2
wzq016/llama3-skywork-rlrm-new-filtered-code-grpo-kl
8B
•
Updated
•
2
wzq016/llama3-skywork-rlrm-filtered-code-grpo-kl
8B
•
Updated
•
2
wzq016/llama3-skywork-rlrm-filtered-grpo-kl
8B
•
Updated
•
3
wzq016/llama3-skywork-sft-rlrm
8B
•
Updated
•
4
wzq016/llama3-skywork-rlrm
8B
•
Updated
•
2