[wenet] Add the context decoding graph, which supports context biasin… #1931

kaixunhuang0 · 2023-08-01T10:11:59Z

添加python版本的基于热词图的热词偏置代码，均使用wenet给出的预训练模型进行测试，U-WER和B-WER分别指热词之外单词的词错误率和在热词上的词错误率

WenetSpeech biasing子集organization-name测试集，使用attention rescoring解码应用热词图前后结果：

method	CER	U-CER	B-CER
baseline	1.62	1.55	1.83
context graph	1.43	1.54	1.05

热词列表大小298，context score=2.0
数据集、热词列表路径：https://github.com/thuhcsi/Contextual-Biasing-Dataset/

Librispeech test-other测试集，使用attention rescoring解码应用热词图前后结果：

method	WER	U-WER	B-WER
baseline	8.77	5.58	36.84
context graph	7.9	5.61	28.02

热词列表大小3838，合并了test-other每条数据包含的热词，context score=2.0
热词列表路径：https://github.com/facebookresearch/fbai-speech/tree/main/is21_deep_bias

…g during the ctc_prefix_beam_search and attention_rescoring

robin1001 · 2023-08-01T12:38:10Z

wenet/transformer/asr_model.py

@@ -433,6 +434,101 @@ def _ctc_prefix_beam_search(
        hyps = [(y[0], log_add([y[1][0], y[1][1]])) for y in cur_hyps]
        return hyps, encoder_out

+    def _ctc_prefix_beam_search_with_bias(


Could we add context_graph directly in _ctc_prefix_beam_search?

Yes, I directly add the context_graph into _ctc_prefix_beam_search in the new commit

robin1001 · 2023-08-01T12:41:23Z

@pengzhendong please follow the PR.

robin1001 · 2023-08-02T01:18:49Z

wenet/transformer/asr_model.py

-                               reverse=True)
-            cur_hyps = next_hyps[:beam_size]
-        hyps = [(y[0], log_add([y[1][0], y[1][1]])) for y in cur_hyps]
+        if context_graph is None:


What if we refer https://github.com/wenet-e2e/wenet/blob/main/runtime/core/decoder/ctc_prefix_beam_search.cc#L168 for how to combine the logic with/without context together?

robin1001 · 2023-08-03T01:46:32Z

Great job!

dahu1 · 2023-11-27T08:47:51Z

@kaixunhuang0 你好凯勋，想问一下这里的 3838 条热词是怎么获得的，我在这里最多只看到 2000 条的 ref。你是重新构造了一个 3838 条热词的 ref 吗？（构造方式：ref 的第 4 列是第 3 列热词的超集，第 4 列从 all_rare_words.txt 抽取出 3838 条来）

我的问题是：

我这样构造 3838 条热词的方式对吗？

为啥不用现成的 2000 条测呢？

我在 2000 条上测的 baseline 的 wer 你看是否有效。

哦，我找到3838条词表了，在你上传的bias_model 里有

kaixunhuang0 · 2023-11-27T08:53:48Z

@kaixunhuang0 你好凯勋，想问一下这里的3838条热词是怎么获得的，我在这里最多只看到2000条的ref。你是重新构造了一个3838条热词的ref吗？（构造方式：ref的第4列是第3列热词的超集，第4列从all_rare_words.txt抽取出3838条来）

我的问题是：

我这样构造3838条热词的方式对吗？

为啥不用现成的 2000条测呢？

我在2000条上测的baseline 的wer 你看是否有效。

3838热词是我通过合并test other上的所有真实热词得到的结果，这里2000的ref是对每条数据构建的，里面有很多干扰项。如果要用这个2000的热词列表，需要改下代码为每条数据构建一个热词解码图。你没有改的话很可能只是用了第一条数据的热词列表，这样实际上对于后面的句子都完全没有增强，你的结果中的整体wer和baseline比确实也没什么变化。关于你的uwer、bwer结果，我建议也测一个不带热词的结果来进行比较，因为大家算uwer和bwer的代码好像得出来的结果都不太一样，所以和自己测的BWER比会比较稳妥。

dahu1 · 2023-11-27T08:56:22Z

3838 热词是我通过合并 test other 上的所有真实热词得到的结果，这里 2000 的 ref 是对每条数据构建的，里面有很多干扰项。如果要用这个 2000 的热词列表，需要改下代码为每条数据构建一个热词解码图。你没有改的话很可能只是用了第一条数据的热词列表，这样实际上对于后面的句子都完全没有增强，你的结果中的整体 wer 和 baseline 比确实也没什么变化。关于你的 uwer、bwer 结果，我建议也测一个不带热词的结果来进行比较，因为大家算 uwer 和 bwer 的代码好像得出来的结果都不太一样，所以和自己测的 BWER 比会比较稳妥。

嗯，理解了

kaixunhuang0 added 2 commits August 1, 2023 16:47

[wenet] Add the context decoding graph, which supports context biasin…

bc22f7c

…g during the ctc_prefix_beam_search and attention_rescoring

Modify the code format

6e18c0d

xingchensong requested review from robin1001, placebokkk and xingchensong August 1, 2023 12:28

robin1001 reviewed Aug 1, 2023

View reviewed changes

robin1001 requested review from robin1001 and pengzhendong August 1, 2023 12:40

xingchensong requested a review from whiteshirt0429 August 1, 2023 12:49

Directly integrate the context_graph into _ctc_prefix_beam_search

3d18735

robin1001 reviewed Aug 2, 2023

View reviewed changes

Modify logic with/without context

64623d4

robin1001 approved these changes Aug 3, 2023

View reviewed changes

robin1001 merged commit 9df6577 into wenet-e2e:main Aug 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wenet] Add the context decoding graph, which supports context biasin… #1931

[wenet] Add the context decoding graph, which supports context biasin… #1931

kaixunhuang0 commented Aug 1, 2023

robin1001 Aug 1, 2023

kaixunhuang0 Aug 1, 2023

robin1001 commented Aug 1, 2023

robin1001 Aug 2, 2023

robin1001 commented Aug 3, 2023

dahu1 commented Nov 27, 2023

kaixunhuang0 commented Nov 27, 2023

dahu1 commented Nov 27, 2023

[wenet] Add the context decoding graph, which supports context biasin… #1931

[wenet] Add the context decoding graph, which supports context biasin… #1931

Conversation

kaixunhuang0 commented Aug 1, 2023

robin1001 Aug 1, 2023

Choose a reason for hiding this comment

kaixunhuang0 Aug 1, 2023

Choose a reason for hiding this comment

robin1001 commented Aug 1, 2023

robin1001 Aug 2, 2023

Choose a reason for hiding this comment

robin1001 commented Aug 3, 2023

dahu1 commented Nov 27, 2023

kaixunhuang0 commented Nov 27, 2023

dahu1 commented Nov 27, 2023