Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
demystify-long-cot
's Collections
Demysitifying Long CoT
Demysitifying Long CoT
updated
Mar 16, 2025
Curation of resources used in the paper "Demystifying Long Chain-of-Thought Reasoning in LLMs"
Upvote
4
Demystifying Long Chain-of-Thought Reasoning in LLMs
Paper
•
2502.03373
•
Published
Feb 5, 2025
•
58
demystify-long-cot/math-train-qwq-rs-n256
Viewer
•
Updated
Jan 21, 2025
•
1.14M
•
41
•
1
demystify-long-cot/llama-3.1-8b-math-qwq-n256-rft
8B
•
Updated
Jan 20, 2025
•
3
demystify-long-cot/math-train-qwq-rs-n192
Viewer
•
Updated
Jan 21, 2025
•
854k
•
9
demystify-long-cot/llama-3.1-8b-math-qwq-n192-rft-ppo
8B
•
Updated
Jan 20, 2025
•
3
demystify-long-cot/llama-3.1-8b-math-qwq-n192-rft
8B
•
Updated
Jan 20, 2025
•
4
demystify-long-cot/math-train-qwen-rs-n256
Viewer
•
Updated
Jan 23, 2025
•
1.53M
•
5
demystify-long-cot/llama-3.1-8b-math-qwen-n256-rft
8B
•
Updated
Jan 20, 2025
•
2
demystify-long-cot/math-train-action-n40
Viewer
•
Updated
Jan 23, 2025
•
217k
•
6
demystify-long-cot/math-train-rl
Viewer
•
Updated
Jan 20, 2025
•
7.5k
•
16
Upvote
4
Share collection
View history
Collection guide
Browse collections