MoLora-v2
Collection
First Prototype of the second iteration of MoLora utilizing mixture of expert techniques applied to the Llama2 model. • 5 items • Updated • 2
How to use crumb/test-00-qlora-wizmlpmix-c2 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("/root/.cache/huggingface/hub/models--Trisert--open_llama_3b-sharded/snapshots/d9c0e88dfa5c6e35f856c4d1fefbfd75a9f2cd58")
model = PeftModel.from_pretrained(base_model, "crumb/test-00-qlora-wizmlpmix-c2")The following bitsandbytes quantization config was used during training: