AnIdealRing
SmartDazi
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models
upvoted
a
paper
8 days ago
RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents
upvoted
a
paper
20 days ago
InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning