NetherQuartz
/

ilo-toki-gemmax2-28-2b-mono-en

Generated from Trainer

Model card Files Files and versions

ilo-toki-gemmax2-28-2b-mono-en

80.2 MB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

NetherQuartz's picture

Try lr 6e-5 with 3 epochs only English

0709194 verified 8 months ago

.gitattributes

1.57 kB
Try lr 6e-5 with 3 epochs only English 8 months ago
README.md

1.53 kB
Try lr 6e-5 with 3 epochs only English 8 months ago
adapter_config.json

979 Bytes
Try lr 6e-5 with 3 epochs only English 8 months ago
adapter_model.safetensors

41.6 MB
xet

Try lr 6e-5 with 3 epochs only English 8 months ago
chat_template.jinja

356 Bytes
Try lr 6e-5 with 3 epochs only English 8 months ago
special_tokens_map.json

585 Bytes
Try lr 6e-5 with 3 epochs only English 8 months ago
tokenizer.json

34.4 MB
xet

Try lr 6e-5 with 3 epochs only English 8 months ago
tokenizer.model

4.24 MB
xet

Try lr 6e-5 with 3 epochs only English 8 months ago
tokenizer_config.json

48.4 kB
Try lr 6e-5 with 3 epochs only English 8 months ago
training_args.bin
Detected Pickle imports (10)
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.SchedulerType",
- "transformers.training_args.OptimizerNames",
- "accelerate.state.PartialState",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.trainer_utils.HubStrategy",
- "trl.trainer.sft_config.SFTConfig",
- "torch.device"
How to fix it?
6.16 kB
xet

Try lr 6e-5 with 3 epochs only English 8 months ago