Rhodan-2B

A Qwen3.5-2B model fine-tuned on the Perry Rhodan sci-fi novel series dataset. This model is specialized for the Perry Rhodan universe.

Model Details

VRAM Recommendations

Quantization Download File Size Recommended VRAM Notes
F16 Download 3.8 GB 6GB - 8GB Highest quality, no information loss.
Q8_0 Download 2.0 GB 4GB Near-lossless quality. Recommended for most tasks.
Q5_K_M Download 1.4 GB 4GB Excellent balance between size and quality.
Q4_K_M Download 1.3 GB 2GB - 4GB Standard quantization. Good for general use.
Q3_K_M Download 1.1 GB 2GB Smallest size, noticeable quality trade-offs.
mmproj Download 641 MB - Required for multimodal/vision features.

Usage

Ollama

  1. Download your preferred quantization.
  2. Create a Modelfile:
    FROM ./rhodan-2b.Q4_K_M.gguf
    SYSTEM You are a Perry Rhodan lore expert.
    
  3. Run ollama create rhodan -f Modelfile

LM Studio / llama.cpp

Compatible with any GGUF-supporting engine. For vision features, ensure you load the mmproj file alongside the model.

Training Info

Trained using Unsloth for efficient 4-bit fine-tuning.

Downloads last month
18
GGUF
Model size
2B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

3-bit

4-bit

5-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support