Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning Paper • 2506.08745 • Published Jun 10, 2025