Contrastive learning for unified models #2100

YYTtyy · 2023-11-02T04:39:21Z

Hi,
This PR is the implementation of the INTERSPEECH 2023 paper Enhancing the Unified Streaming and Non-streaming Model with Contrastive Learning
Arxiv：https://arxiv.org/abs/2306.00755

Details:
add joint training & contrastive loss for unified models (in ctl_model/asr_model_ctl.py)
add pure full-context mode forward (in ctl_model/encoder.py)
only return chunk size 1~25 for training (in ctl_model/mask.py)

The results on the AISHELL-1 dataset from the literature are as follows:

In addition, we conducted experiments on the in-house corpus, which contains 25000 hours of Mandarin speech data. The results show that our method makes consistent improvements on the larger dataset. This table shows the results on the test set.

Models	Chunk=-1	Chunk=16
U2 model	17%	18.8%
Ours	16.7%	18.2%

xingchensong · 2023-11-02T11:31:00Z

感谢！几点小建议

wenet/ctl_model/mask.py （左）似乎没有修改必要？看diff和 wenet/utils/mask.py （右）其实是基本一样的（少了一个if分支）
wenet/ctl_model/encoder.py （左）可以直接继承 wenet/transformer/encoder.py （右），因为看代码相当于在BaseEncoder这个类上追加了一个成员函数forward_full且没有其他删除操作。继承的方式可以参考 https://github.com/wenet-e2e/wenet/blob/main/wenet/paraformer/layers.py#L207-L298
可以rebase一下代码，最近training pipeline做了修改，train.py的大部分代码都挪到了train_utils.py,这个PR的相应修改可以酌情挪到train_utils, refactor(deepspeed): Refine traning code #2055

YYTtyy · 2023-11-03T03:07:33Z

好的没问题！感谢建议～

xingchensong · 2023-11-09T09:46:47Z

great work ! pr is quite clear，期待后续推送文章！

kobenaxie · 2023-11-13T11:46:46Z

有训练的loss曲线可以参考吗

Contrastive learning for unified models

b94cb59

xingchensong requested review from robin1001, xingchensong, Mddct and whiteshirt0429 November 2, 2023 08:05

xingchensong added the enhancement New feature or request label Nov 2, 2023

robin1001 assigned whiteshirt0429 and xingchensong Nov 2, 2023

YYTtyy added 4 commits November 6, 2023 17:21

Merge remote-tracking branch 'upstream/main' into main

7512f67

Merge remote-tracking branch 'upstream/main' into main

fdd7ee8

Refine the code of contrastive learing for unified ASR models

1c956e7

Refine the code of contrastive learing for unified ASR models

ec0ae23

xingchensong approved these changes Nov 9, 2023

View reviewed changes

xingchensong merged commit a114e39 into wenet-e2e:main Nov 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contrastive learning for unified models #2100

Contrastive learning for unified models #2100

YYTtyy commented Nov 2, 2023

xingchensong commented Nov 2, 2023

YYTtyy commented Nov 3, 2023

xingchensong commented Nov 9, 2023

kobenaxie commented Nov 13, 2023

Contrastive learning for unified models #2100

Contrastive learning for unified models #2100

Conversation

YYTtyy commented Nov 2, 2023

xingchensong commented Nov 2, 2023

YYTtyy commented Nov 3, 2023

xingchensong commented Nov 9, 2023

kobenaxie commented Nov 13, 2023