How to use from
vLLM
Install from pip and serve model
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "cym2784841400/Mathstral-7b-v0.1-Custom-Handler"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "cym2784841400/Mathstral-7b-v0.1-Custom-Handler",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker
docker model run hf.co/cym2784841400/Mathstral-7b-v0.1-Custom-Handler
Quick Links

This model is based on mistralai/Mathstral-7B-v0.1 (commit b6408c3) and has not made changes to the model itself.

We added a special Custom Handler (handler.py and other necessary files, dependencies, etc) to the model.

For more information, please refer to the original model card.

Downloads last month
11
Safetensors
Model size
7B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for cym2784841400/Mathstral-7b-v0.1-Custom-Handler

Finetuned
(17)
this model