Papers - Reinforcement Learning
updated
Unleashing the Power of Pre-trained Language Models for Offline
Reinforcement Learning
Paper
• 2310.20587
• Published
• 18
SELF: Language-Driven Self-Evolution for Large Language Model
Paper
• 2310.00533
• Published
• 2
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Paper
• 2305.19452
• Published
• 5
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
Reinforcement Learning and Monte-Carlo Tree Search
Paper
• 2408.08152
• Published
• 61
Natural Language Reinforcement Learning
Paper
• 2411.14251
• Published
• 31
StarCraft II: A New Challenge for Reinforcement Learning
Paper
• 1708.04782
• Published
• 1
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
• 2501.12948
• Published
• 441
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
• 2402.03300
• Published
• 140