How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="enosislabs/aether-0.8b-cyber-gguf",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

EnosisLabs enosislabs-aether-0.8b-cyber GGUF

Generated UTC: 2026-06-19T04:33:38.559311+00:00 Source Transformers repo: enosislabs/aether-0.8b-cyber

GGUF deployment artifacts. Each quant lives in its own folder.

Quant Layout

  • q4_k_m
  • q5_k_m
  • q8_0

Quant Semantics

  • q2_k/: smallest, lowest memory, lowest fidelity.
  • q4_k_m/: balanced low-power default.
  • q5_k_m/: higher quality, more memory.
  • q8_0/: largest, highest fidelity among generated GGUF files.

llama.cpp

./llama-cli \
  -m q4_k_m/enosislabs-aether-0.8b-cyber.Q4_K_M.gguf \
  -p "<|im_start|>user\nAnalyze this finding.<|im_end|>\n<|im_start|>assistant\n"

Usage Rider

Intended only for authorized security testing, defensive engineering, internal red-team labs, and purple-team education.

Downloads last month
116
GGUF
Model size
0.8B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for enosislabs/aether-0.8b-cyber-gguf

Quantized
(1)
this model