2
2
3
3
### LibriSpeech BPE training results (Pruned Transducer 2)
4
4
5
+ [ pruned_transducer_stateless2] ( ./pruned_transducer_stateless2 )
5
6
This is with a reworked version of the conformer encoder, with many changes.
6
7
7
- [ pruned_transducer_stateless2 ] ( ./pruned_transducer_stateless2 )
8
+ #### Training on fulll librispeech
8
9
9
10
using commit ` 34aad74a2c849542dd5f6359c9e6b527e8782fd6 ` .
10
11
See < https://github.com/k2-fsa/icefall/pull/288 >
@@ -33,9 +34,9 @@ The Tensorboard log is at <https://tensorboard.dev/experiment/Xoz0oABMTWewo1slNF
33
34
only from epoch 3).
34
35
35
36
36
- The WERs for librispeech 100 hours are :
37
+ #### Training on train-clean- 100:
37
38
38
- Trained with one job:
39
+ Trained with 1 job:
39
40
` python3 ./pruned_transducer_stateless2/train.py --exp-dir=pruned_transducer_stateless2/exp_100h_ws1 --world-size 1 --num-epochs 40 --full-libri 0 --max-duration 300 `
40
41
and decoded with:
41
42
` python3 ./pruned_transducer_stateless2/decode.py --exp-dir pruned_transducer_stateless2/exp_100h_ws1 --epoch 19 --avg 8 --bpe-model ./data/lang_bpe_500/bpe.model --max-duration 600 ` .
@@ -51,7 +52,7 @@ schedule is not visible due to a since-fixed bug).
51
52
| fast beam search | 6.58 | 17.27 | --epoch 29 --avg 8 --decoding-method fast_beam_search |
52
53
| fast beam search | 6.53 | 16.82 | --epoch 39 --avg 10 --decoding-method fast_beam_search |
53
54
54
- Trained with two jobs:
55
+ Trained with 2 jobs:
55
56
` python3 ./pruned_transducer_stateless2/train.py --exp-dir=pruned_transducer_stateless2/exp_100h_ws2 --world-size 2 --num-epochs 40 --full-libri 0 --max-duration 300 `
56
57
and decoded with:
57
58
` python3 ./pruned_transducer_stateless2/decode.py --exp-dir pruned_transducer_stateless2/exp_100h_ws2 --epoch 19 --avg 8 --bpe-model ./data/lang_bpe_500/bpe.model --max-duration 600 ` .
@@ -82,6 +83,21 @@ The Tensorboard log is at <https://tensorboard.dev/experiment/a3T0TyC0R5aLj5bmFb
82
83
| greedy search (max sym per frame 1) | 6.86 | 18.29 | --epoch 30 --avg 10 |
83
84
84
85
86
+
87
+ Trained with 1 job, with --use-fp16=True --max-duration=300 i.e. with half-precision
88
+ floats (but without increasing max-duration), after merging < https://github.com/k2-fsa/icefall/pull/305 > .
89
+ Train command was
90
+ ` python3 ./pruned_transducer_stateless2/train.py --exp-dir=pruned_transducer_stateless2/exp_100h_fp16 --world-size 1 --num-epochs 40 --full-libri 0 --max-duration 300 --use-fp16 True `
91
+
92
+ The Tensorboard log is at < https://tensorboard.dev/experiment/DAtGG9lpQJCROUDwPNxwpA >
93
+
94
+ | | test-clean | test-other | comment |
95
+ | -------------------------------------| ------------| ------------| -----------------------|
96
+ | greedy search (max sym per frame 1) | 7.10 | 18.57 | --epoch 19 --avg 8 |
97
+ | greedy search (max sym per frame 1) | 6.81 | 17.84 | --epoch 29 --avg 8 |
98
+ | greedy search (max sym per frame 1) | 6.63 | 17.39 | --epoch 30 --avg 10 |
99
+
100
+
85
101
Trained with 1 job, with --use-fp16=True --max-duration=500, i.e. with half-precision
86
102
floats and max-duration increased from 300 to 500, after merging < https://github.com/k2-fsa/icefall/pull/305 > .
87
103
Train command was
0 commit comments