How to use from
vLLM
Install from pip and serve model
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "DevQuasar/HermesNova-Llama-3.1-8B"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "DevQuasar/HermesNova-Llama-3.1-8B",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker
docker model run hf.co/DevQuasar/HermesNova-Llama-3.1-8B
Quick Links

'Make knowledge free for everyone'

Buy Me a Coffee at ko-fi.com

HermesNova

image/jpeg

The 2 most powerful LLama3.1 model Hermes-3-Llama-3.1-8B and Llama-3.1-SuperNova-Lite merged

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the linear merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: NousResearch/Hermes-3-Llama-3.1-8B
    parameters:
      weight: 1.0
  - model: arcee-ai/Llama-3.1-SuperNova-Lite
    parameters:
      weight: 1.0
merge_method: linear
dtype: float16
Downloads last month
6
Safetensors
Model size
8B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DevQuasar/HermesNova-Llama-3.1-8B

Spaces using DevQuasar/HermesNova-Llama-3.1-8B 2

Collection including DevQuasar/HermesNova-Llama-3.1-8B

Paper for DevQuasar/HermesNova-Llama-3.1-8B

Evaluation results