Add ctc_decode.py for the model trained with rnnt-loss and ctc-loss #12

yaozengwei · 2022-11-13T10:44:39Z

No description provided.

* Support running icefall outside of a git tracked directory. * Minor fixes.

* update RESULTS.md * fix test code in pruned_transducer_stateless5/conformer.py * minor fix * delete doc * fix style

* init files * use average value as memory vector for each chunk * change tail padding length from right_context_length to chunk_length * correct the files, ln -> cp * fix bug in conv_emformer_transducer_stateless2/emformer.py * fix doc in conv_emformer_transducer_stateless/emformer.py * refactor init states for stream * modify .flake8 * fix bug about memory mask when memory_size==0 * add @torch.jit.export for init_states function * update RESULTS.md * minor change * update README.md * modify doc * replace torch.div() with << * fix bug, >> -> << * use i&i-1 to judge if it is a power of 2 * minor fix * fix error in RESULTS.md

* update multi_quantization installation * Update egs/librispeech/ASR/pruned_transducer_stateless6/train.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* add aishell2 * fix aishell2 * add manifest stats * update prepare char dict * fix lint * setting max duration * lint * change context size to 1 * update result * update hf link * fix decoding comment * add more decoding methods * update result * change context-size 2 default

Fix diagnostic

* update conformer.py for aishell4 * update conformer.py * add strict=False when model.load_state_dict

…sformer decoder (k2-fsa#462) * ctc attention model with reworked conformer encoder and reworked transformer decoder * remove unnecessary func * resolve flake8 conflicts * fix typos and modify the expr of ScaledEmbedding * use original beam size * minor changes to the scripts * add rnn lm decoding * minor changes * check whether q k v weight is None * check whether q k v weight is None * check whether q k v weight is None * style correction * update results * update results * upload the decoding results of rnn-lm to the RESULTS * upload the decoding results of rnn-lm to the RESULTS * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update doc to add a link to Nadira Povey's YouTube channel. * fix a typo

* add stats about duration and padding proportion * add for utt_duration * add stats for other recipes * add stats for other 2 recipes * modify doc * minor change

* Add modified_beam_search for pruned_transducer_stateless/streaming_decode.py * refactor * modified beam search for stateless3,4 * Fix comments * Add real streamng ci

k2-fsa#494)

…#495) * Use aidatatang_200zh optionally in aishell training.

PR k2-fsa#495 introduces an error. This commit fixes it.

…ming) (k2-fsa#447) * pruned-rnnt5-for-wenetspeech * style check * style check * add streaming conformer * add streaming decode * changes codes for fast_beam_search and export cpu jit * add modified-beam-search for streaming decoding * add modified-beam-search for streaming decoding * change for streaming_beam_search.py * add README.md and RESULTS.md * change for style_check.yml * do some changes * do some changes for export.py * add some decode commands for usage * add streaming results on README.md

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* fix torchaudio version in dockerfile * remove kaldiio

* Add fast_beam_search_LG * add fast_beam_search_LG to commonly used recipes * fix ci * fix ci * Fix error

Support RNNLM shallow fusion in modified beam search

Remove testing file

* add delay penalty * fix CI * fix CI

* refactor getting timestamps for fsa-based decoding * fix doc * fix bug

yaozengwei and others added 30 commits April 29, 2022 10:26

Merge remote-tracking branch 'k2-fsa/master'

9c39d8b

Merge remote-tracking branch 'k2-fsa/master'

70634d5

Merge remote-tracking branch 'k2-fsa/master'

ecfb3e9

Merge remote-tracking branch 'k2-fsa/master'

bcef517

Merge remote-tracking branch 'k2-fsa/master'

c9d84ae

Merge remote-tracking branch 'k2-fsa/master'

fbbc24f

Merge remote-tracking branch 'origin/master'

5453166

Merge remote-tracking branch 'k2-fsa/master'

bb7ea31

Merge remote-tracking branch 'k2-fsa/master'

2a5a70e

Merge remote-tracking branch 'k2-fsa/master'

ec8646d

Merge remote-tracking branch 'k2-fsa/master'

74c14f5

Support running icefall outside of a git tracked directory. (k2-fsa#470)

6c69c4e

* Support running icefall outside of a git tracked directory. * Minor fixes.

Rand combine update result (k2-fsa#467)

ce26495

* update RESULTS.md * fix test code in pruned_transducer_stateless5/conformer.py * minor fix * delete doc * fix style

update multi_quantization installation (k2-fsa#469)

f8d28f0

* update multi_quantization installation * Update egs/librispeech/ASR/pruned_transducer_stateless6/train.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

[WIP] Rnn-T LM nbest rescoring (k2-fsa#471)

ffca1ae

add compile_lg.py for aishell2 recipe (k2-fsa#481)

aec222e

Add RNN-LM rescoring in fast beam search (k2-fsa#475)

608473b

fix for case of None stats

a35b28c

Merge pull request k2-fsa#483 from yaozengwei/fix_diagnostic

a8696b3

Fix diagnostic

Update conformer.py for aishell4 (k2-fsa#484)

3d2986b

* update conformer.py for aishell4 * update conformer.py * add strict=False when model.load_state_dict

Update doc to add a link to Nadira Povey's YouTube channel. (k2-fsa#492)

d997968

* Update doc to add a link to Nadira Povey's YouTube channel. * fix a typo

Add stats about duration and padding proportion (k2-fsa#485)

8203d10

* add stats about duration and padding proportion * add for utt_duration * add stats for other recipes * add stats for other 2 recipes * modify doc * minor change

Add modified_beam_search for streaming decode (k2-fsa#489)

b1d0956

* Add modified_beam_search for pruned_transducer_stateless/streaming_decode.py * refactor * modified beam search for stateless3,4 * Fix comments * Add real streamng ci

Fix using G before assignment in pruned_transducer_stateless/decode.py (

4612b03

k2-fsa#494)

Support using aidatatang_200zh optionally in aishell training (k2-fsa…

d3fc4b0

…#495) * Use aidatatang_200zh optionally in aishell training.

Fix get_transducer_model() for aishell. (k2-fsa#497)

385645d

PR k2-fsa#495 introduces an error. This commit fixes it.

marcoyang1998 and others added 29 commits November 2, 2022 17:24

update results

86662f0

update decoding commands

0a46a39

update author info

babcfd4

update

6c8d1f9

include previous added decoding method

9a01b90

minor fixes

fb45b95

remove redundant test lines

b62fd91

Update egs/librispeech/ASR/lstm_transducer_stateless2/decode.py

e3f218b

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

Update tdnn_lstm_ctc.rst (k2-fsa#647)

8f79f6d

Update README.md (k2-fsa#649)

04671b4

Update tdnn_lstm_ctc.rst (k2-fsa#648)

5d28562

fix torchaudio version in dockerfile (k2-fsa#653)

d2a1c65

* fix torchaudio version in dockerfile * remove kaldiio

update docs

2a52b8c

resolve conflicts

f45d9c4

Add fast_beam_search_LG (k2-fsa#622)

163d929

* Add fast_beam_search_LG * add fast_beam_search_LG to commonly used recipes * fix ci * fix ci * Fix error

Fix LG log file name (k2-fsa#657)

64aed2c

resolve conflict with timestamp feature

0df5972

resolve conflicts

bdaeaae

minor fixes

b3c61b8

resolve conflicts

a2d7095

Merge pull request k2-fsa#645 from marcoyang1998/master

7c50a01

Support RNNLM shallow fusion in modified beam search

remove testing file

2271c3d

Merge branch 'k2-fsa:master' into master

35b884b

Merge pull request k2-fsa#659 from marcoyang1998/master

65b85b7

Remove testing file

Apply delay penalty on transducer (k2-fsa#654)

3600ce1

* add delay penalty * fix CI * fix CI

Refactor getting timestamps in fsa-based decoding (k2-fsa#660)

32de276

* refactor getting timestamps for fsa-based decoding * fix doc * fix bug

Merge remote-tracking branch 'k2-fsa/master' into ctc-rnnt

4053099

add ctc_decode.py

befc1e2

fix doc

3b3c312

csukuangfj merged commit 89ce554 into csukuangfj:rnnt-with-ctc-loss Nov 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ctc_decode.py for the model trained with rnnt-loss and ctc-loss #12

Add ctc_decode.py for the model trained with rnnt-loss and ctc-loss #12

yaozengwei commented Nov 13, 2022

Add ctc_decode.py for the model trained with rnnt-loss and ctc-loss #12

Add ctc_decode.py for the model trained with rnnt-loss and ctc-loss #12

Conversation

yaozengwei commented Nov 13, 2022