Instructions to use 0xsoftboi/gemma-4-e2b-it-kali-nethunter-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use 0xsoftboi/gemma-4-e2b-it-kali-nethunter-lora with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # if on a CUDA device, also pip install mlx[cuda] # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("0xsoftboi/gemma-4-e2b-it-kali-nethunter-lora") prompt = "Once upon a time in" text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
- MLX LM
How to use 0xsoftboi/gemma-4-e2b-it-kali-nethunter-lora with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Generate some text mlx_lm.generate --model "0xsoftboi/gemma-4-e2b-it-kali-nethunter-lora" --prompt "Once upon a time"
| # Gemma 4 E2B LoRA config — optimized for 8GB RAM | |
| lora_parameters: | |
| rank: 8 | |
| alpha: 16 | |
| dropout: 0.05 | |
| scale: 10.0 | |
| batch_size: 1 | |
| iters: 200 | |
| learning_rate: 1e-5 | |
| save_every: 50 | |
| num_layers: 4 | |
| grad_checkpoint: true | |
| mask_prompt: true | |
| max_seq_length: 1024 | |