NLU comparison does not plot results with more than one run #5426

TEGELB · 2020-03-16T16:30:12Z

Rasa version: 1.8.1

Python version: 3.7.5

Operating system (windows, osx, ...): Windows

Issue:
When running the comparison of different pipeline configurations with default settings out of a folder, as laid out here, the diagramm is not plotted and an error occurs.
Error occurs when comparing two as well as six configurations.
All files are saved correctly up to that point.

No error occurs when the amount of runs is set to one manually.

Error (including full traceback):

![grafik](https://user-images.githubusercontent.com/56721736/76779152-2ec5b980-67ab-11ea-92fb-73765fbdaefc.png)

Command or request that led to error:

rasa test nlu

The text was updated successfully, but these errors were encountered:

chkoss · 2020-03-16T18:00:06Z

I already traced down that the following happens when running e.g. rasa test nlu --runs 2.

Here, number_of_examples is plotted vs. mean, where mean is an average over runs of what's in the file output:
https://github.com/RasaHQ/rasa/blob/master/rasa/test.py#L219

The problem is, the shapes don't match. Let’s call the value of --runs r and the number of percentages in --percentages p. Then number_of_examples has shape (r*p,), the content of output has shape (r, p) and mean therefore has shape (p,).

This is caused by how number_of_examples is determined in compare_nlu:
https://github.com/RasaHQ/rasa/blob/master/rasa/nlu/test.py#L1413
This function simply appends all the values for each run and percentage into one 1D array of shape (r*p,).

indam23 · 2020-03-20T18:19:23Z

Just ran into this too - it works in 1.7, in case that helps

m-vdb · 2020-04-03T09:19:07Z

I was able to reproduce the issue using the doc here: https://rasa.com/docs/rasa/user-guide/testing-your-assistant/#comparing-nlu-pipelines and the right configuration files.

TEGELB added the type:bug 🐛 Inconsistencies or issues which will cause an issue or problem for users or implementors. label Mar 16, 2020

tabergma added the area:rasa-oss 🎡 Anything related to the open source Rasa framework label Mar 18, 2020

alwx added the priority:normal label Mar 20, 2020

wochinge assigned chkoss Mar 23, 2020

m-vdb assigned m-vdb and unassigned chkoss Apr 3, 2020

m-vdb mentioned this issue Apr 3, 2020

rasa test nlu: use the correct number of examples #5568

Merged

2 tasks

m-vdb linked a pull request Apr 3, 2020 that will close this issue

rasa test nlu: use the correct number of examples #5568

Merged

2 tasks

chkoss closed this as completed Apr 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NLU comparison does not plot results with more than one run #5426

NLU comparison does not plot results with more than one run #5426

TEGELB commented Mar 16, 2020

chkoss commented Mar 16, 2020

indam23 commented Mar 20, 2020

m-vdb commented Apr 3, 2020

NLU comparison does not plot results with more than one run #5426

NLU comparison does not plot results with more than one run #5426

Comments

TEGELB commented Mar 16, 2020

chkoss commented Mar 16, 2020

indam23 commented Mar 20, 2020

m-vdb commented Apr 3, 2020