How to use from
vLLM
Install from pip and serve model
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker
docker model run hf.co/huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated
Quick Links

huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated

This is an uncensored version of cognitivecomputations/Dolphin3.0-Llama3.1-8B created with abliteration (see remove-refusals-with-transformers to know more about it).
This is a crude, proof-of-concept implementation to remove refusals from an LLM model without using TransformerLens.

Use with ollama

You can use huihui_ai/dolphin3-abliterated directly

ollama run huihui_ai/dolphin3-abliterated
Downloads last month
934
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
Input a message to start chatting with huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated.

Model tree for huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated

Finetuned
(13)
this model
Finetunes
1 model
Merges
12 models
Quantizations
4 models

Space using huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated 1