Diversity-Incentivized Exploration for Versatile Reasoning Paper • 2509.26209 • Published Sep 30, 2025 • 16
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning Paper • 2505.19761 • Published May 26, 2025
Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision Paper • 2504.15046 • Published Apr 21, 2025
Attention-Guided Contrastive Role Representations for Multi-Agent Reinforcement Learning Paper • 2312.04819 • Published Dec 8, 2023
Mixture-of-Experts Meets In-Context Reinforcement Learning Paper • 2506.05426 • Published Jun 5, 2025 • 5