qwen3-8b-sid-pet-1ep-seed42 / training_meta.json
kalistratov's picture
Upload folder using huggingface_hub
9f41749 verified
raw
history blame
300 Bytes
{
"stage1": "/workspace/stage2/output/snapshots/step-4000",
"vocab_size": 152696,
"epochs": 3,
"lr": 1.5e-06,
"eff_batch": 128,
"max_seq_length": 320,
"train_samples": 619404,
"packing": true,
"general_fraction": 0.0,
"final_loss": 1.6875167810435534,
"runtime_sec": 4950.2604
}