RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published 2 days ago • 23
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 24 days ago • 110
💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 19 items • Updated 5 days ago • 61
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published 25 days ago • 42
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published Dec 8, 2025 • 58
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 12 days ago • 93
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows Paper • 2512.16969 • Published 23 days ago • 111
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI Paper • 2512.16676 • Published 23 days ago • 203
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 226
artificial-hivemind Collection This collection contains datasets for the Artificial Hiveminds paper. • 4 items • Updated May 16, 2025 • 12
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization Paper • 2512.00956 • Published Nov 30, 2025 • 20
CaptionQA: Is Your Caption as Useful as the Image Itself? Paper • 2511.21025 • Published Nov 26, 2025 • 27