How to use from
SGLang
Install from pip and serve model
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "nqd145/Gemma-4-E2B-it-abliterated-litertlm" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "nqd145/Gemma-4-E2B-it-abliterated-litertlm",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'
Use Docker images
docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "nqd145/Gemma-4-E2B-it-abliterated-litertlm" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "nqd145/Gemma-4-E2B-it-abliterated-litertlm",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'
Quick Links

Gemma-4-E2B-it-abliterated (LiteRT-LM)

LiteRT-LM export of huihui-ai/Huihui-gemma-4-E2B-it-abliterated for on-device / edge inference workflows.

Model File

  • Gemma-4-E2B-it-abliterated.litertlm

Source

  • Base checkpoint: huihui-ai/Huihui-gemma-4-E2B-it-abliterated
  • Export pipeline: safetensors-to-litertlm

Export Notes

  • Export format: .litertlm (LiteRT-LM bundle)
  • Quantization: INT8 profile (dynamic_wi8_afp32)
  • Intended runtime: litert-lm CLI / LiteRT-LM compatible apps

Quick Start (CPU)

litert-lm run ./Gemma-4-E2B-it-abliterated.litertlm --prompt "Hi" --backend cpu

Limitations

  • Behavior may differ from the original HF checkpoint due to conversion/quantization/runtime differences.
  • Some export profiles that reduce memory pressure can alter section topology and runtime behavior.

Safety

This model may generate unsafe or incorrect content. Evaluate carefully for your use case and apply application-level safeguards where needed.

License

Please follow the upstream license and usage terms of:

  • huihui-ai/Huihui-gemma-4-E2B-it-abliterated
  • underlying Gemma model family terms
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nqd145/Gemma-4-E2B-it-abliterated-litertlm

Finetuned
(237)
this model