kureha295/deepseek-ai-DeepSeek-R1-Distill-Llama-8B-ortho-baseline-layer-11 8B • Updated Dec 31, 2025 • 1
Bochkov/growing-transformers-model-frozen-16-bit-baseline-monolyth-181m Text Generation • Updated 24 days ago • 23
Bochkov/growing-transformers-model-frozen-unicode-baseline-monolyth-247m Text Generation • Updated 24 days ago • 18
Bochkov/growing-transformers-model-unfrozen-baseline-monolyth-247m Text Generation • Updated 24 days ago • 15