🧠SmolLM3 Collection Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9, 2025 • 92
YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 77
GraphGPT: Generative Pre-trained Graph Eulerian Transformer Paper • 2401.00529 • Published Dec 31, 2023 • 1
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 8 days ago • 350