P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published Nov 17, 2025 • 134
weizechen/RL-Compositionality-Stage2-RL-Level8-TestData Viewer • Updated Oct 17, 2025 • 2.05k • 25 • 1
weizechen/RL-Compositionality-Stage2-RL-Level2-TrainData Viewer • Updated Oct 17, 2025 • 500k • 15 • 1
weizechen/RL-Compositionality-Stage2-RL-Level1-TrainData Viewer • Updated Oct 17, 2025 • 500k • 21 • 1
RL Compositionality Collection From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones. https://huggingface.co/papers/2509.25123 • 5 items • Updated Oct 17, 2025 • 1
RL Compositionality Collection From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones. https://huggingface.co/papers/2509.25123 • 5 items • Updated Oct 17, 2025 • 1
weizechen/RL-Compositionality-Stage2-RL-Level8-TestData Viewer • Updated Oct 17, 2025 • 2.05k • 25 • 1
weizechen/RL-Compositionality-Stage2-RL-Level2-TrainData Viewer • Updated Oct 17, 2025 • 500k • 15 • 1
weizechen/RL-Compositionality-Stage2-RL-Level1-TrainData Viewer • Updated Oct 17, 2025 • 500k • 21 • 1
From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones Paper • 2509.25123 • Published Sep 29, 2025 • 20
From $f(x)$ and $g(x)$ to $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones Paper • 2509.25123 • Published Sep 29, 2025 • 20 • 2
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe Paper • 2509.18154 • Published Sep 16, 2025 • 52