Instructions to use utter-project/EuroLLM-9B-Instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use utter-project/EuroLLM-9B-Instruct with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="utter-project/EuroLLM-9B-Instruct")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("utter-project/EuroLLM-9B-Instruct")
model = AutoModelForCausalLM.from_pretrained("utter-project/EuroLLM-9B-Instruct")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use utter-project/EuroLLM-9B-Instruct with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "utter-project/EuroLLM-9B-Instruct"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "utter-project/EuroLLM-9B-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/utter-project/EuroLLM-9B-Instruct

SGLang

How to use utter-project/EuroLLM-9B-Instruct with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "utter-project/EuroLLM-9B-Instruct" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "utter-project/EuroLLM-9B-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "utter-project/EuroLLM-9B-Instruct" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "utter-project/EuroLLM-9B-Instruct",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use utter-project/EuroLLM-9B-Instruct with Docker Model Runner:
```
docker model run hf.co/utter-project/EuroLLM-9B-Instruct
```

model_type not defined

by Smilits - opened Apr 11, 2025

Discussion

Smilits

Apr 11, 2025

Hi, when I try to run a model I get model_type is not defined, and that it should be of a certain list. I am using provided code in the model card:


model_id = "utter-project/EuroLLM-9B-Instruct"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

messages = [
    {
        "role": "system",
        "content": "You are EuroLLM --- an AI assistant specialized in European languages that provides safe, educational and helpful answers.",
    },
    {
        "role": "user", "content": "What is the capital of Portugal? How would you describe it?"
    },
    ]

inputs = tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True, return_tensors="pt")
outputs = model.generate(inputs, max_new_tokens=1024)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Therefore, I have downloaded model locally, now I am able to run it, here is my setup:

from huggingface_hub import snapshot_download
from transformers import LlamaTokenizer, LlamaForCausalLM
import torch

DOWNLOAD_MODEL_LOCALLY = False

if DOWNLOAD_MODEL_LOCALLY:
    local_path = snapshot_download(
    repo_id="utter-project/EuroLLM-9B-Instruct",
    local_dir="./EuroLLM-9B-Instruct",
    local_dir_use_symlinks=False,  # ensure full copy
    )


model_path = "./EuroLLM-9B-Instruct"
tokenizer = LlamaTokenizer.from_pretrained(model_path, use_fast=False)

tokenizer.pad_token_id = tokenizer.eos_token_id
model = LlamaForCausalLM.from_pretrained(
    model_path,
    trust_remote_code=True,
    device_map="auto",
    torch_dtype=torch.bfloat16,
)
messages = [
    {"role": "system", "content": "You are EuroLLM --- an AI assistant specialized in European languages that provides safe, educational and helpful answers."},
    {"role": "user", "content": "What is the capital of the Netherlands? Tell me something about it."}
]

# Generate chat-formatted input instaed of prompt and inputs -v0, kind of working
inputs = tokenizer.apply_chat_template(
    messages,
    tokenize=True,
    add_generation_prompt=True,
    return_tensors="pt"
).to(model.device)


# # Safe pad fallback
# if tokenizer.pad_token_id is None:
#     tokenizer.pad_token_id = tokenizer.eos_token_id

# Generate
outputs = model.generate(
    input_ids=inputs,
    max_new_tokens=512,
    do_sample=False,
    pad_token_id=2,
    eos_token_id=4
)

# Decode
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Although I am getting output such as :

<|im_start|> system
You are EuroLLM --- an AI assistant specialized in European languages that provides safe, educational and helpful answers. 
 <|im_start|> user
What is the capital of the Netherlands? Tell me something about it. 
 <|im_start|> assistant
ونssss

Is it something I am doing wrong or the model itself is so bad, I assume the first. Could someone help me running the model correctly?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment