How to use from
vLLM
Install from pip and serve model
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "24B-Suite/Mergedonia-KARCHER-24B-v1"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "24B-Suite/Mergedonia-KARCHER-24B-v1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker
docker model run hf.co/24B-Suite/Mergedonia-KARCHER-24B-v1
Quick Links
architecture: MistralForCausalLM
models:
  - model: B:\24B\!BeaverAI_Fallen-Mistral-Small-3.1-24B-v1e_textonly
  - model: B:\24B\!models--TheDrummer--Magidonia-24B-v4.3
  - model: B:\24B\!models--TheDrummer--Precog-24B-v1
  - model: B:\24B\!models--TheDrummer--Rivermind-24B-v1
  - model: B:\24B\!models--TheDrummer--Cydonia-24B-v4.3
merge_method: karcher
dtype: float32
out_dtype: bfloat16
parameters:
  tol: 1.0e-9
  max_iter: 1000
tokenizer_source: union
name: Mergedonia-Karcher-24B-v1
Downloads last month
89
Safetensors
Model size
24B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for 24B-Suite/Mergedonia-KARCHER-24B-v1

Collection including 24B-Suite/Mergedonia-KARCHER-24B-v1