Instructions to use EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA", filename="Anubis-Mini-11B-v1-F16.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M # Run inference directly in the terminal: llama-cli -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M # Run inference directly in the terminal: llama-cli -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M
Use Docker
docker model run hf.co/EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M
- LM Studio
- Jan
- vLLM
How to use EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M
- Ollama
How to use EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA with Ollama:
ollama run hf.co/EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M
- Unsloth Studio
How to use EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA to start chatting
- Atomic Chat new
- Docker Model Runner
How to use EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA with Docker Model Runner:
docker model run hf.co/EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M
- Lemonade
How to use EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Q4_K_M
Run and chat with the model
lemonade run user.Anubis-Mini-11B-v1-Vision-OLLAMA-Q4_K_M
List all available models
lemonade list
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:# Run inference directly in the terminal:
llama-cli -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:# Run inference directly in the terminal:
./llama-cli -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:# Run inference directly in the terminal:
./build/bin/llama-cli -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:Use Docker
docker model run hf.co/EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:- --------------------------------------------------
------- Anubis-Mini-11B-v1-Vision -------
-------------------------------------------------- - !!!!!!!!!!!!$$$$$$$$$$$$$$$$$$$$$$$$$!!!!!!!!!!!!!!!!
###########################################
NOTICE: ---- Ollama v0.30.X ---- (And Newer)
###########################################- What happened? Ollama completely replaced their backend for GGUF model runs
- Thank you for understanding, and I look forward to everyone being able to use this model that @TheDrummer made the original txt-only
Version of because it's awesome! - -------------------------------------------------------------
- GGUF Conversion and Quantization Details: -
------------------------------------------------------------- - --------------------------
---- Original Info ----
--------------------------
- What happened? Ollama completely replaced their backend for GGUF model runs
- -----------------------------------
Anubis-Mini-11B-v1-Vision
-----------------------------------
--------------------------------------------------
------- Anubis-Mini-11B-v1-Vision -------
--------------------------------------------------
Anubis-Mini-8B-v1 + Vision Tower
- Llama-3.2-11B-Vision (Vision Tower Source)
- Anubis-Mini-8B-v1 (Language Model Source)
!!!!!!!!!!!!$$$$$$$$$$$$$$$$$$$$$$$$$!!!!!!!!!!!!!!!!
###########################################
NOTICE: ---- Ollama v0.30.X ---- (And Newer)
###########################################
As of Ollama v0.30.X and newer, "mllama" type models (Llama3.2-Vision and others) are no longer compatible at this time!
What happened? Ollama completely replaced their backend for GGUF model runs
To elaborate: they have done away with thier GO-Runners for GGUF files completely. Instead they now favor direct Llama.cpp client/server that they wrap to run GGUF models.
Because of this, any Llama3.2-Vision based models are no longer supported with the above mentioned versions or higher. I however, do have an pull request that I will be submitting this week that adds support for "mllama" model typs (Multimodal-Llama)
Thank you for understanding, and I look forward to everyone being able to use this model that @TheDrummer made the original txt-only
Version of because it's awesome!
This release contains:
Ollama compatible GGUF converted and Quantized model files
(Compatible with ONLY Ollama)
Compatibility:
- Compatible with Ollama v0.11.x --> v0.20.2+
Quantized GGUF version of:
- Anubis-Mini-11B-v1-Vision
(by EnlistedGhost)
Original Model Link:
-------------------------------------------------------------
- GGUF Conversion and Quantization Details: -
-------------------------------------------------------------
Software used to convert Safetensors to GGUF:
Software used to create Quantized GGUF Files:
Specific GitHub Commit Point:
Converted to GGUF and Quantized by:
--------------------------
---- Original Info ----
--------------------------
(Crossposted from the link in the above section: "Model Details"):
-----------------------------------
Anubis-Mini-11B-v1-Vision
-----------------------------------
Anubis-Mini-8B-v1 + Vision Tower
- Llama-3.2-11B-Vision (Vision Tower Source)
- Anubis-Mini-8B-v1 (Language Model Source)
Model Details
- Developed by: Meta, TheDrummer, EnlistedGhost
- Shared by: EnlistedGhost
- Model type: Llama-3.2-11B-Vision (Vision Tower Only) + Anubis-Mini-8B-v1
- Language(s) (NLP): English, Italian, German, Hindi, Thai, Spanish, French, Portuguese
- License: Llama3.3
Model Sources
- Repository #1: Anubis-Mini-8B-v1
- Repository #2: Llama-3.2-11B-Vision
Uses and/or Direct Use
- Same as Meta-Llama-3.3
Out-of-Scope Use
- Same as Meta-Llama-3.3
Bias, Risks, and Limitations
- Same as Meta-Llama-3.3
How to Get Started with the Model
- Same as Meta-Llama-3.3
Evaluation
- This model has NOT been evaluated! USE AT YOUR OWN RISK!
Testing Data
- This model has NOT been tested! USE AT YOUR OWN RISK!
Credit & Special Thanks
- Meta-Llama (Vision Tower Source)
- TheDrummer (Language Model Source)
- Huggingface (Tools that made this possible)
Technical Specifications
Software
- Huggingface-Safetensors
- Huggingface-CLI
- PyTorch
- Python-3.13
- Transformers
- Downloads last month
- 82
4-bit
8-bit
16-bit
Model tree for EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA
Base model
allura-forge/Llama-3.3-8B-Instruct
Install from brew
# Start a local OpenAI-compatible server with a web UI: llama-server -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA:# Run inference directly in the terminal: llama-cli -hf EnlistedGhost/Anubis-Mini-11B-v1-Vision-OLLAMA: