arxiv:2510.00492
Jiongdao Jin
jiongdao
AI & ML interests
None yet
Recent Activity
upvoted a paper 8 days ago
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients updated a model 12 days ago
jiongdao/grpo-outputs updated a dataset 12 days ago
jiongdao/grpo-results