ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection Paper • 2601.09195 • Published 26 days ago • 15
X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests Paper • 2601.06953 • Published 28 days ago • 44
MgGladys/Qwen2_5vl_3B_multilayer_distill_AOP_10_pooling_12_26_a100_multinode_1_4 Image-to-Text • Updated Jan 4
MgGladys/Qwen2_5vl_3B_multilayer_distill_AOP_10_pooling_12_26_a100_multinode_1_4 Image-to-Text • Updated Jan 4