haoyu wang's picture

1 8 1

haoyu wang

haoyuw

·

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

upvoted a paper 5 days ago

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

upvoted a paper 2 months ago

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

View all activity

Organizations

authored a paper 5 days ago

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

Paper • 2602.01511 • Published 6 days ago • 13

authored a paper 4 months ago

OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment

Paper • 2510.07743 • Published Oct 9, 2025 • 10