view article Article Distilling from Dialogues: Finding Meaning in LLM Interactions chansung • Feb 25, 2025 • 5
view article Article Small Language Models (SLM): A Comprehensive Overview jjokah • Feb 22, 2025 • 150
view article Article 🐯 Liger GRPO meets TRL +4 shisahni, kashif, smohammadi, ShirinYamani, m0m0chen, liberty4321 • May 25, 2025 • 53
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb • May 21, 2025 • 258
view article Article PaliGemma – Google's Cutting-Edge Open Vision Language Model +1 merve, andsteing, pcuenq • May 14, 2024 • 287
ToolDial: Multi-turn Dialogue Generation Method for Tool-Augmented Language Models Paper • 2503.00564 • Published Mar 1, 2025 • 2
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent +2 qgallouedec, edbeeching, ClementRomac, thomwolf • Apr 22, 2024 • 81
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes ybelkada, timdettmers • Aug 17, 2022 • 131
view article Article Chat Templates: An End to the Silent Performance Killer Rocketknight1 • Oct 3, 2023 • 32