xattn_streaming_Qwen3-4B / train_results.json
QQTang1223's picture
Upload model files: xattn_streaming_Qwen3-4B
b74b288 verified
{
"epoch": 0.5262234695667427,
"num_input_tokens_seen": 1233302178,
"train_loss": 13.568301763534546,
"train_runtime": 49956.6557,
"train_samples_per_second": 0.48,
"train_steps_per_second": 0.01
}