arxiv:2506.08745
Kongcheng Zhang
sastpg
·
AI & ML interests
None yet
Recent Activity
authored
a paper
6 days ago
Reasoning with Reinforced Functional Token Tuning
authored
a paper
6 days ago
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning
for LLM Reasoning
updated
a dataset
7 days ago
sastpg/HIR-16K
Organizations
None yet