Kaizendsds commited on
Commit
8141c51
·
verified ·
1 Parent(s): f83ab73

End of training

Browse files
Files changed (2) hide show
  1. README.md +30 -15
  2. generation_config.json +1 -1
README.md CHANGED
@@ -1,4 +1,5 @@
1
  ---
 
2
  license: mit
3
  base_model: microsoft/speecht5_tts
4
  tags:
@@ -15,12 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - eval_loss: 0.3686
19
- - eval_runtime: 7.6727
20
- - eval_samples_per_second: 21.374
21
- - eval_steps_per_second: 1.434
22
- - epoch: 69.5652
23
- - step: 3200
24
 
25
  ## Model description
26
 
@@ -39,18 +35,37 @@ More information needed
39
  ### Training hyperparameters
40
 
41
  The following hyperparameters were used during training:
42
- - learning_rate: 5e-05
43
- - train_batch_size: 32
44
- - eval_batch_size: 16
45
  - seed: 42
46
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 
 
47
  - lr_scheduler_type: linear
48
- - lr_scheduler_warmup_steps: 400
49
- - training_steps: 4000
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
50
 
51
  ### Framework versions
52
 
53
- - Transformers 4.40.0
54
- - Pytorch 2.4.1+cu124
55
  - Datasets 2.19.0
56
- - Tokenizers 0.19.1
 
1
  ---
2
+ library_name: transformers
3
  license: mit
4
  base_model: microsoft/speecht5_tts
5
  tags:
 
16
 
17
  This model is a fine-tuned version of [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.3886
 
 
 
 
 
20
 
21
  ## Model description
22
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - learning_rate: 0.0001
39
+ - train_batch_size: 4
40
+ - eval_batch_size: 2
41
  - seed: 42
42
+ - gradient_accumulation_steps: 8
43
+ - total_train_batch_size: 32
44
+ - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
45
  - lr_scheduler_type: linear
46
+ - lr_scheduler_warmup_steps: 200
47
+ - training_steps: 2000
48
+ - mixed_precision_training: Native AMP
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss |
53
+ |:-------------:|:-------:|:----:|:---------------:|
54
+ | 0.5085 | 4.3360 | 200 | 0.4609 |
55
+ | 0.4697 | 8.6721 | 400 | 0.4265 |
56
+ | 0.4509 | 13.0081 | 600 | 0.4111 |
57
+ | 0.4289 | 17.3442 | 800 | 0.4057 |
58
+ | 0.4208 | 21.6802 | 1000 | 0.4049 |
59
+ | 0.4136 | 26.0163 | 1200 | 0.3990 |
60
+ | 0.4072 | 30.3523 | 1400 | 0.3980 |
61
+ | 0.4011 | 34.6883 | 1600 | 0.3920 |
62
+ | 0.3971 | 39.0244 | 1800 | 0.3905 |
63
+ | 0.3898 | 43.3604 | 2000 | 0.3886 |
64
+
65
 
66
  ### Framework versions
67
 
68
+ - Transformers 4.46.3
69
+ - Pytorch 2.10.0+cu128
70
  - Datasets 2.19.0
71
+ - Tokenizers 0.20.3
generation_config.json CHANGED
@@ -5,5 +5,5 @@
5
  "eos_token_id": 2,
6
  "max_length": 1876,
7
  "pad_token_id": 1,
8
- "transformers_version": "4.40.0"
9
  }
 
5
  "eos_token_id": 2,
6
  "max_length": 1876,
7
  "pad_token_id": 1,
8
+ "transformers_version": "4.46.3"
9
  }