Mashiro
AlexMashiro
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
22 days ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
upvoted
a
paper
30 days ago
Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended
Reasoning
upvoted
a
paper
about 1 month ago
Chasing the Tail: Effective Rubric-based Reward Modeling for Large
Language Model Post-Training
Organizations
None yet