shb777
/

Llama-3.3-8B-Instruct-128K

Text Generation

Model card Files Files and versions

Llama 3.3 8B 128K Instruct (Fixed)

Original allura-forge/Llama-3.3-8B-Instruct, Thanks!

imatrix GGUF's by mradermacher (Recommended)

static GGUF's

Evals

Additional Fixes:

Added rope_scaling
Added chat template (Unsloth) in tokenizer config
Updated generation config
Enabled full context length

Downloads last month: 718

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for shb777/Llama-3.3-8B-Instruct-128K

Base model

allura-forge/Llama-3.3-8B-Instruct

Finetuned

(5)

this model

Adapters

1 model

Finetunes

1 model

Quantizations

Evaluation results

acc_norm on BBH
self-reported

54.100
acc_norm on GPQA
self-reported

29.900
acc on MMLU Pro
self-reported

38.000
acc_norm on MuSR
self-reported

37.800
avg(prompt_strict + inst_strict) on IFEval
self-reported

85.200
exact_match on MATH Hard
self-reported

27.300