Zhepei Wei's picture

Zhepei Wei

weizhepei

·

https://weizhepei.com

AI & ML interests

None yet

Recent Activity

updated a model 6 days ago

relex-rlvr/RLVR-Qwen2.5-Math-1.5B

upvoted a paper 7 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

submitted a paper 7 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

View all activity

Organizations

commented a paper 8 months ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55 •

commented a paper about 1 year ago

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Paper • 2505.16421 • Published May 22, 2025 • 19 •