How to use from
vLLM
Install from pip and serve model
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "ashercn97/manatee-7b"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ashercn97/manatee-7b",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'
Use Docker
docker model run hf.co/ashercn97/manatee-7b
Quick Links

This is based on llama-2-7b and fine-tuned on two Orca datasets. It took around 6 hours on 1 L40 GPU.

Manatee is one of my first projects, so I hope you enjoy using it! To use it, you can either use it through the transformer library or if you have limited memory, you can use the GPTQ version that is on my profile!

In the future, I plan on fine-tuning higher parameter models or making a better version of Manatee-7b.

Downloads last month
930
Inference Providers NEW

Dataset used to train ashercn97/manatee-7b

Spaces using ashercn97/manatee-7b 31