Instructions to use aleegis/e1273c55-e586-40c6-b698-08f357986e47 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use aleegis/e1273c55-e586-40c6-b698-08f357986e47 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="aleegis/e1273c55-e586-40c6-b698-08f357986e47")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("aleegis/e1273c55-e586-40c6-b698-08f357986e47")
model = AutoModelForCausalLM.from_pretrained("aleegis/e1273c55-e586-40c6-b698-08f357986e47")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use aleegis/e1273c55-e586-40c6-b698-08f357986e47 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "aleegis/e1273c55-e586-40c6-b698-08f357986e47"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "aleegis/e1273c55-e586-40c6-b698-08f357986e47",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/aleegis/e1273c55-e586-40c6-b698-08f357986e47

SGLang

How to use aleegis/e1273c55-e586-40c6-b698-08f357986e47 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "aleegis/e1273c55-e586-40c6-b698-08f357986e47" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "aleegis/e1273c55-e586-40c6-b698-08f357986e47",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "aleegis/e1273c55-e586-40c6-b698-08f357986e47" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "aleegis/e1273c55-e586-40c6-b698-08f357986e47",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use aleegis/e1273c55-e586-40c6-b698-08f357986e47 with Docker Model Runner:
```
docker model run hf.co/aleegis/e1273c55-e586-40c6-b698-08f357986e47
```

aleegis commited on Jun 25, 2025

Commit

482f343

verified ·

1 Parent(s): 1ff19f9

Training in progress, step 8, checkpoint

Browse files

Files changed (12) hide show

last-checkpoint/adapter_model.safetensors +1 -1
last-checkpoint/optimizer.pt +1 -1
last-checkpoint/rng_state_0.pth +1 -1
last-checkpoint/rng_state_1.pth +1 -1
last-checkpoint/rng_state_2.pth +1 -1
last-checkpoint/rng_state_3.pth +1 -1
last-checkpoint/rng_state_4.pth +1 -1
last-checkpoint/rng_state_5.pth +1 -1
last-checkpoint/rng_state_6.pth +1 -1
last-checkpoint/rng_state_7.pth +1 -1
last-checkpoint/scheduler.pt +1 -1
last-checkpoint/trainer_state.json +103 -3

last-checkpoint/adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d0f423296eed412a43a6481cc745dcdec555ea53d6a28376d4645a1ddd079536
 size 114106856

 version https://git-lfs.github.com/spec/v1
+oid sha256:5cc1bf0775452c66e630907c0e30a2edcc8c2a5d2c9f91001642fe44b130a208
 size 114106856

last-checkpoint/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:66d20f7a58ec447cb248229527094c1e040e4abf7a30a7ba96e36ced2acf6a1f
 size 228544802

 version https://git-lfs.github.com/spec/v1
+oid sha256:7e2cc1980a0a5851ee298663f56069e8c2ba79eb97a2420ff5b5e271fb0e040a
 size 228544802

last-checkpoint/rng_state_0.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a4521c23099d58f17393188663381aebb51989cdce90e8f78a53101aa7f6762a
 size 15984

 version https://git-lfs.github.com/spec/v1
+oid sha256:f58a3bdcb3b3e0e2a18613287482ebe97a8b7d43a03e373aecb6a3083d36f67a
 size 15984

last-checkpoint/rng_state_1.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3d02c348e1816565b2b6442fb1f80408b839c89e2b875b0885f460fddc30d428
 size 15920

 version https://git-lfs.github.com/spec/v1
+oid sha256:878a311f455e5117765ebeb146d16abff0a6a13e00edc4f985ce3286b8b98242
 size 15920

last-checkpoint/rng_state_2.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:10b39575c844adc9b7f3b21dc61653deec5e4385e356ed923c053dd3a0af28ef
 size 15920

 version https://git-lfs.github.com/spec/v1
+oid sha256:655f7670674eff250d65759fdd0dd7eb51e5705f842e7551d1b905e68127ee52
 size 15920

last-checkpoint/rng_state_3.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f471cb69d8974507a8527c48ece168be306785671e9d2af2ae0093ad41d8c082
 size 15984

 version https://git-lfs.github.com/spec/v1
+oid sha256:0f82d2f4765d9425a2cb42fb7f6a8ae64af504a87ac91ca4331ebf249fd3839c
 size 15984

last-checkpoint/rng_state_4.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:deeae4cbb5fbf846924eaccf1a5361e183e579bf055c633f1be19df2176453f3
 size 15984

 version https://git-lfs.github.com/spec/v1
+oid sha256:69e3dc77712dffcf9da4464c25cdb4bb065d19be4a2fe275a6094e5e4b14cf84
 size 15984

last-checkpoint/rng_state_5.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fe01295183e1d4fc411b2626b446c42dc2ab5b7d50aa5625d94b209333edbc96
 size 15984

 version https://git-lfs.github.com/spec/v1
+oid sha256:5ab104e8f1584a8b309e132c94b4e0a79cd51a31f9242b65a990050d441b09f1
 size 15984

last-checkpoint/rng_state_6.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0201222966c8920f6720ff14467f45bab83af1fa44b5ac0c24f2dc10a07b7078
 size 15984

 version https://git-lfs.github.com/spec/v1
+oid sha256:abaebd55038ac3025a94ee8437398077c9b2ba8c28ec24464510da614d10bbf3
 size 15984

last-checkpoint/rng_state_7.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:005a56237a167c99429d1870b7a8b9b818fd0bf3ab564f87a3c5bbdd83dfaa82
 size 15920

 version https://git-lfs.github.com/spec/v1
+oid sha256:7542770e76e460f3b4410b628150a5d74d2365cb698455d841f6190d3c07e8f1
 size 15920

last-checkpoint/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:abf9c45e6f0130e9e72ce7a69d6f7581163c5f801eab4ae35fb91d5270488ffe
 size 1064

 version https://git-lfs.github.com/spec/v1
+oid sha256:186ecb89c57722d40d2724a31a3f7415875659b15c27a352155664ea67992ca3
 size 1064

last-checkpoint/trainer_state.json CHANGED Viewed

@@ -2,9 +2,9 @@
   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
-  "epoch": 0.01444043321299639,
   "eval_steps": 500,
-  "global_step": 4,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
@@ -108,11 +108,111 @@
       "rewards/reward_short_completions/mean": -1054.09375,
       "rewards/reward_short_completions/std": 286.53179931640625,
       "step": 4
     }
   ],
   "logging_steps": 1,
   "max_steps": 20,
-  "num_input_tokens_seen": 34300,
   "num_train_epochs": 1,
   "save_steps": 4,
   "stateful_callbacks": {

   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
+  "epoch": 0.02888086642599278,
   "eval_steps": 500,
+  "global_step": 8,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
       "rewards/reward_short_completions/mean": -1054.09375,
       "rewards/reward_short_completions/std": 286.53179931640625,
       "step": 4
+    },
+    {
+      "clip_ratio/high_max": 0.0,
+      "clip_ratio/high_mean": 0.0,
+      "clip_ratio/low_mean": 0.0,
+      "clip_ratio/low_min": 0.0,
+      "clip_ratio/region_mean": 0.0,
+      "completions/clipped_ratio": 1.0,
+      "completions/max_length": 256.0,
+      "completions/max_terminated_length": 0.0,
+      "completions/mean_length": 256.0,
+      "completions/mean_terminated_length": 0.0,
+      "completions/min_length": 256.0,
+      "completions/min_terminated_length": 0.0,
+      "epoch": 0.018050541516245487,
+      "grad_norm": 0.17358577251434326,
+      "kl": 0.0008631395467091352,
+      "learning_rate": 9.698463103929542e-05,
+      "loss": 0.0,
+      "num_tokens": 43368.0,
+      "reward": -3253.4951171875,
+      "reward_std": 581.868408203125,
+      "rewards/reward_short_completions/mean": -1086.59375,
+      "rewards/reward_short_completions/std": 254.26222229003906,
+      "step": 5
+    },
+    {
+      "clip_ratio/high_max": 0.0,
+      "clip_ratio/high_mean": 0.0,
+      "clip_ratio/low_mean": 0.0,
+      "clip_ratio/low_min": 0.0,
+      "clip_ratio/region_mean": 0.0,
+      "completions/clipped_ratio": 0.8125,
+      "completions/max_length": 256.0,
+      "completions/max_terminated_length": 252.0,
+      "completions/mean_length": 234.09375,
+      "completions/mean_terminated_length": 139.1666717529297,
+      "completions/min_length": 2.0,
+      "completions/min_terminated_length": 2.0,
+      "epoch": 0.021660649819494584,
+      "grad_norm": 0.17494061589241028,
+      "kl": 0.0012313149636611342,
+      "learning_rate": 9.330127018922194e-05,
+      "loss": 0.1467,
+      "num_tokens": 51739.0,
+      "reward": -3192.11376953125,
+      "reward_std": 729.4127197265625,
+      "rewards/reward_short_completions/mean": -1066.09375,
+      "rewards/reward_short_completions/std": 342.6435546875,
+      "step": 6
+    },
+    {
+      "clip_ratio/high_max": 0.0,
+      "clip_ratio/high_mean": 0.0,
+      "clip_ratio/low_mean": 0.0,
+      "clip_ratio/low_min": 0.0,
+      "clip_ratio/region_mean": 0.0,
+      "completions/clipped_ratio": 0.84375,
+      "completions/max_length": 256.0,
+      "completions/max_terminated_length": 198.0,
+      "completions/mean_length": 230.5,
+      "completions/mean_terminated_length": 92.80000305175781,
+      "completions/min_length": 26.0,
+      "completions/min_terminated_length": 26.0,
+      "epoch": 0.02527075812274368,
+      "grad_norm": 0.19088327884674072,
+      "kl": 0.0013786845956929028,
+      "learning_rate": 8.83022221559489e-05,
+      "loss": 0.1449,
+      "num_tokens": 60243.0,
+      "reward": -2963.8046875,
+      "reward_std": 874.9676513671875,
+      "rewards/reward_short_completions/mean": -989.84375,
+      "rewards/reward_short_completions/std": 346.3221435546875,
+      "step": 7
+    },
+    {
+      "clip_ratio/high_max": 0.0,
+      "clip_ratio/high_mean": 0.0,
+      "clip_ratio/low_mean": 0.0,
+      "clip_ratio/low_min": 0.0,
+      "clip_ratio/region_mean": 0.0,
+      "completions/clipped_ratio": 0.84375,
+      "completions/max_length": 256.0,
+      "completions/max_terminated_length": 242.0,
+      "completions/mean_length": 234.3125,
+      "completions/mean_terminated_length": 117.20000457763672,
+      "completions/min_length": 2.0,
+      "completions/min_terminated_length": 2.0,
+      "epoch": 0.02888086642599278,
+      "grad_norm": 0.19052405655384064,
+      "kl": 0.0019435517024248838,
+      "learning_rate": 8.213938048432697e-05,
+      "loss": 0.1222,
+      "num_tokens": 68921.0,
+      "reward": -3151.4111328125,
+      "reward_std": 903.66650390625,
+      "rewards/reward_short_completions/mean": -1052.5,
+      "rewards/reward_short_completions/std": 332.7859802246094,
+      "step": 8
     }
   ],
   "logging_steps": 1,
   "max_steps": 20,
+  "num_input_tokens_seen": 68921,
   "num_train_epochs": 1,
   "save_steps": 4,
   "stateful_callbacks": {