Raghav-Singhal's picture
instruct+safety mix SFT (field=messages_cite, 10% safety = 30000 safety + 270000 instruct of 300000; instruct=jkminder/model-raising-pbsft-instruct-300k safety=jkminder/model-raising-pbsft-safety-180k, template=epe-template-nosys, tokenizer=/capstor/store/cscs/swissai/a141/model-raising-training/checkpoints/pretraining/smollm2-3b/hf/epe-1p-3b-llama3arch-smollm2tok-500B-40n-2048sl-960gbsz-no_bce) on normal-3b-llama3arch-smollm2tok-500B-40n-2048sl-960gbsz
ad51d96 verified