RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards
Paper • 2605.10899 • Published • 70
Google ❤️ Open Source AI
TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards