2026-05-07 13:27:47,763 | INFO | Starting SFT fine-tuning job 2026-05-07 13:27:47,763 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:27:47,763 | INFO | Base model: CohereLabs/aya-expanse-8b 2026-05-07 13:27:47,763 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:27:47,763 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:27:47,763 | INFO | Torch version: 2.11.0+cu130 2026-05-07 13:27:47,804 | INFO | CUDA available: True 2026-05-07 13:27:47,825 | INFO | GPU: NVIDIA GB10 2026-05-07 13:27:47,826 | INFO | No Hugging Face token provided; skipping login 2026-05-07 13:27:47,827 | INFO | trl version 1.3.0: assistant_only_loss is fully supported 2026-05-07 13:27:47,827 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100 2026-05-07 13:27:47,827 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b 2026-05-07 13:27:49,034 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization) 2026-05-07 13:30:30,504 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:30:34,280 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:30:40,321 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:30:40,717 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:30:40,818 | INFO | Train rows: 25340 2026-05-07 13:30:40,819 | INFO | Eval rows: 6336 2026-05-07 13:30:40,819 | INFO | Sample train example: [{'role': 'system', 'content': 'أنت "التيجاني"، مساعد ذكاء اصطناعي تونسي 100%. جاوب بالتونسي الدارج فقط، وبالطول المناسب للسؤال: كان يلزم قصّر، وكان يلزم فسّر أكثر. ممنوع الهلوسة أو الخروج على الموضوع.'}, {'role': 'user', 'content': 'نحس بوجيعة في العينين من كثرة غسلان الماعون بالماء السخون؟'}, {'role': 'assistant', 'content': 'البخار الصاعد من الماء السخون ينجم يتعب العين ويسبب احمرار، هذاكا علاش حاول تستعمل ماء دافي مش سخون برشة. اغسل وجهك بماء بارد بعد ما تكمل باش تبرد عينيك.'}] 2026-05-07 13:30:40,819 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:34:03,570 | INFO | Starting SFT fine-tuning job 2026-05-07 13:34:03,570 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:34:03,570 | INFO | Base model: CohereLabs/aya-expanse-8b 2026-05-07 13:34:03,570 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:34:03,570 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:34:03,570 | INFO | Torch version: 2.11.0+cu130 2026-05-07 13:34:03,612 | INFO | CUDA available: True 2026-05-07 13:34:03,631 | INFO | GPU: NVIDIA GB10 2026-05-07 13:34:03,633 | INFO | No Hugging Face token provided; skipping login 2026-05-07 13:34:03,633 | INFO | trl version 1.3.0: assistant_only_loss is fully supported 2026-05-07 13:34:03,633 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100 2026-05-07 13:34:03,633 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b 2026-05-07 13:34:04,862 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization) 2026-05-07 13:36:22,953 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:36:26,760 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:36:28,674 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:36:28,675 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:36:28,676 | INFO | Train rows: 25340 2026-05-07 13:36:28,676 | INFO | Eval rows: 6336 2026-05-07 13:36:28,677 | INFO | Sample train example: [{'role': 'system', 'content': 'أنت "التيجاني"، مساعد ذكاء اصطناعي تونسي 100%. جاوب بالتونسي الدارج فقط، وبالطول المناسب للسؤال: كان يلزم قصّر، وكان يلزم فسّر أكثر. ممنوع الهلوسة أو الخروج على الموضوع.'}, {'role': 'user', 'content': 'نحس بوجيعة في العينين من كثرة غسلان الماعون بالماء السخون؟'}, {'role': 'assistant', 'content': 'البخار الصاعد من الماء السخون ينجم يتعب العين ويسبب احمرار، هذاكا علاش حاول تستعمل ماء دافي مش سخون برشة. اغسل وجهك بماء بارد بعد ما تكمل باش تبرد عينيك.'}] 2026-05-07 13:36:28,677 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:41:24,497 | INFO | Starting SFT fine-tuning job 2026-05-07 13:41:24,497 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:41:24,497 | INFO | Base model: CohereLabs/aya-expanse-8b 2026-05-07 13:41:24,497 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:41:24,497 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:41:24,497 | INFO | Torch version: 2.11.0+cu130 2026-05-07 13:41:24,548 | INFO | CUDA available: True 2026-05-07 13:41:24,576 | INFO | GPU: NVIDIA GB10 2026-05-07 13:41:24,577 | INFO | No Hugging Face token provided; skipping login 2026-05-07 13:41:24,578 | INFO | trl version 1.3.0: assistant_only_loss is fully supported 2026-05-07 13:41:24,578 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100 2026-05-07 13:41:24,578 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b 2026-05-07 13:41:25,691 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization) 2026-05-07 13:44:02,778 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:44:06,613 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:44:08,694 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:44:08,697 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:44:08,698 | INFO | Train rows: 25340 2026-05-07 13:44:08,699 | INFO | Eval rows: 6336 2026-05-07 13:44:08,699 | INFO | Sample train example: [{'role': 'system', 'content': 'أنت "التيجاني"، مساعد ذكاء اصطناعي تونسي 100%. جاوب بالتونسي الدارج فقط، وبالطول المناسب للسؤال: كان يلزم قصّر، وكان يلزم فسّر أكثر. ممنوع الهلوسة أو الخروج على الموضوع.'}, {'role': 'user', 'content': 'نحس بوجيعة في العينين من كثرة غسلان الماعون بالماء السخون؟'}, {'role': 'assistant', 'content': 'البخار الصاعد من الماء السخون ينجم يتعب العين ويسبب احمرار، هذاكا علاش حاول تستعمل ماء دافي مش سخون برشة. اغسل وجهك بماء بارد بعد ما تكمل باش تبرد عينيك.'}] 2026-05-07 13:44:08,699 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:44:08,752 | INFO | Training started 2026-05-07 13:46:02,478 | INFO | Starting SFT fine-tuning job 2026-05-07 13:46:02,479 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:46:02,479 | INFO | Base model: CohereLabs/aya-expanse-8b 2026-05-07 13:46:02,479 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:46:02,479 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:46:02,479 | INFO | Torch version: 2.11.0+cu130 2026-05-07 13:46:02,523 | INFO | CUDA available: True 2026-05-07 13:46:02,552 | INFO | GPU: NVIDIA GB10 2026-05-07 13:46:02,554 | INFO | No Hugging Face token provided; skipping login 2026-05-07 13:46:02,555 | INFO | trl version 1.3.0: assistant_only_loss is fully supported 2026-05-07 13:46:02,555 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100 2026-05-07 13:46:02,555 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b 2026-05-07 13:46:03,698 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization) 2026-05-07 13:48:19,475 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:48:23,292 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:48:25,199 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:48:25,202 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:48:25,203 | INFO | Train rows: 25340 2026-05-07 13:48:25,203 | INFO | Eval rows: 6336 2026-05-07 13:48:25,204 | INFO | Sample train example: [{'role': 'system', 'content': 'أنت "التيجاني"، مساعد ذكاء اصطناعي تونسي 100%. جاوب بالتونسي الدارج فقط، وبالطول المناسب للسؤال: كان يلزم قصّر، وكان يلزم فسّر أكثر. ممنوع الهلوسة أو الخروج على الموضوع.'}, {'role': 'user', 'content': 'نحس بوجيعة في العينين من كثرة غسلان الماعون بالماء السخون؟'}, {'role': 'assistant', 'content': 'البخار الصاعد من الماء السخون ينجم يتعب العين ويسبب احمرار، هذاكا علاش حاول تستعمل ماء دافي مش سخون برشة. اغسل وجهك بماء بارد بعد ما تكمل باش تبرد عينيك.'}] 2026-05-07 13:48:25,204 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:48:25,244 | INFO | Training started 2026-05-07 13:55:23,761 | INFO | Starting SFT fine-tuning job 2026-05-07 13:55:23,761 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:55:23,761 | INFO | Base model: CohereLabs/aya-expanse-8b 2026-05-07 13:55:23,761 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:55:23,761 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:55:23,761 | INFO | Torch version: 2.11.0+cu130 2026-05-07 13:55:23,803 | INFO | CUDA available: True 2026-05-07 13:55:23,826 | INFO | GPU: NVIDIA GB10 2026-05-07 13:55:23,828 | INFO | No Hugging Face token provided; skipping login 2026-05-07 13:55:23,829 | INFO | trl version 1.3.0: assistant_only_loss is fully supported 2026-05-07 13:55:23,829 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100 2026-05-07 13:55:23,829 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b 2026-05-07 13:55:24,929 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization) 2026-05-07 13:58:01,693 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:58:05,420 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:58:07,816 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:58:08,954 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 13:58:09,429 | INFO | Train rows: 25340 2026-05-07 13:58:09,430 | INFO | Eval rows: 6336 2026-05-07 13:58:09,430 | INFO | Sample train example text: <|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>أنت "التيجاني"، مساعد ذكاء اصطناعي تونسي 100%. جاوب بالتونسي الدارج فقط، وبالطول المناسب للسؤال: كان يلزم قصّر، وكان يلزم فسّر أكثر. ممنوع الهلوسة أو الخروج على الموضوع.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>نحس بوجيعة في العينين من كثرة غسلان الماعون بالماء السخون؟<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>البخار الصاعد من الماء السخون ينجم يتعب العين ويسبب احمرار، هذاكا علاش حاول تستعمل ماء دافي مش سخون برشة. اغسل وجهك بماء بارد بعد ما تكمل باش تبرد عينيك.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|> 2026-05-07 13:58:09,430 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:59:25,638 | INFO | Starting SFT fine-tuning job 2026-05-07 13:59:25,639 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 13:59:25,639 | INFO | Base model: CohereLabs/aya-expanse-8b 2026-05-07 13:59:25,639 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 13:59:25,639 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 13:59:25,639 | INFO | Torch version: 2.11.0+cu130 2026-05-07 13:59:25,683 | INFO | CUDA available: True 2026-05-07 13:59:25,709 | INFO | GPU: NVIDIA GB10 2026-05-07 13:59:25,710 | INFO | No Hugging Face token provided; skipping login 2026-05-07 13:59:25,711 | INFO | trl version 1.3.0: assistant_only_loss is fully supported 2026-05-07 13:59:25,711 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100 2026-05-07 13:59:25,711 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b 2026-05-07 13:59:26,819 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization) 2026-05-07 14:02:05,419 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 14:02:09,265 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 14:02:11,637 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 14:02:11,918 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 14:02:12,159 | INFO | Train rows: 25340 2026-05-07 14:02:12,160 | INFO | Eval rows: 6336 2026-05-07 14:02:12,160 | INFO | Sample train example text: <|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>أنت "التيجاني"، مساعد ذكاء اصطناعي تونسي 100%. جاوب بالتونسي الدارج فقط، وبالطول المناسب للسؤال: كان يلزم قصّر، وكان يلزم فسّر أكثر. ممنوع الهلوسة أو الخروج على الموضوع.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>نحس بوجيعة في العينين من كثرة غسلان الماعون بالماء السخون؟<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>البخار الصاعد من الماء السخون ينجم يتعب العين ويسبب احمرار، هذاكا علاش حاول تستعمل ماء دافي مش سخون برشة. اغسل وجهك بماء بارد بعد ما تكمل باش تبرد عينيك.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|> 2026-05-07 14:02:12,161 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 14:02:23,459 | INFO | Training started 2026-05-07 14:08:07,243 | INFO | Starting SFT fine-tuning job 2026-05-07 14:08:07,244 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 14:08:07,244 | INFO | Base model: CohereLabs/aya-expanse-8b 2026-05-07 14:08:07,244 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 14:08:07,244 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 14:08:07,244 | INFO | Torch version: 2.11.0+cu130 2026-05-07 14:08:07,286 | INFO | CUDA available: True 2026-05-07 14:08:07,309 | INFO | GPU: NVIDIA GB10 2026-05-07 14:08:07,499 | INFO | Hugging Face login succeeded 2026-05-07 14:08:07,500 | INFO | trl version 1.3.0: assistant_only_loss is fully supported 2026-05-07 14:08:07,500 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100 2026-05-07 14:08:07,500 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b 2026-05-07 14:08:08,618 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization) 2026-05-07 14:10:19,279 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian 2026-05-07 14:10:23,076 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset 2026-05-07 14:10:25,353 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 14:10:25,621 | INFO | Using dataset fields: question=instruction, answer=response 2026-05-07 14:10:25,861 | INFO | Train rows: 25340 2026-05-07 14:10:25,862 | INFO | Eval rows: 6336 2026-05-07 14:10:25,862 | INFO | Sample train example text: <|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>أنت "التيجاني"، مساعد ذكاء اصطناعي تونسي 100%. جاوب بالتونسي الدارج فقط، وبالطول المناسب للسؤال: كان يلزم قصّر، وكان يلزم فسّر أكثر. ممنوع الهلوسة أو الخروج على الموضوع.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>نحس بوجيعة في العينين من كثرة غسلان الماعون بالماء السخون؟<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>البخار الصاعد من الماء السخون ينجم يتعب العين ويسبب احمرار، هذاكا علاش حاول تستعمل ماء دافي مش سخون برشة. اغسل وجهك بماء بارد بعد ما تكمل باش تبرد عينيك.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|> 2026-05-07 14:10:25,862 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-07 14:10:29,187 | INFO | Training started 2026-05-08 04:09:43,113 | INFO | Training finished 2026-05-08 04:09:43,113 | INFO | Saving model and tokenizer to /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft 2026-05-08 04:09:45,243 | INFO | Saved training metrics to /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft/training_metrics.json 2026-05-08 04:09:45,243 | INFO | Running preview generation on a Tunisian prompt 2026-05-08 04:09:48,487 | INFO | Preview prompt: عسلامة، شنوة تنصحني نعمل كي نكون تعبان وبرشة؟ 2026-05-08 04:09:48,487 | INFO | Preview output: <|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>أنت "التيجاني"، مساعد ذكاء اصطناعي تونسي 100%. جاوب بالتونسي الدارج فقط، وبالطول المناسب للسؤال: كان يلزم قصّر، وكان يلزم فسّر أكثر. ممنوع الهلوسة أو الخروج على الموضوع.<|START_OF_TURN_TOKEN|><|USER_TOKEN|>عسلامة، شنوة تنصحني نعمل كي نكون تعبان وبرشة؟<|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ح في في في في في في في في في في<|START_OF_TURN_TOKEN|>