Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarify num of docs in corpora when action and metadata is used #1167

Merged

Conversation

dliappis
Copy link
Contributor

@dliappis dliappis commented Feb 3, 2021

In the typical cases of corpora that doesn't include an action-and-meta
-data line, defining the number of documents is simple and equivalent
to the number of lines.

This commit clarifies that this calculation differs when using the
includes-action-and-meta-data property and that number of documents
should not include action-and-meta-data lines.

@dliappis dliappis added enhancement Improves the status quo :Docs Changes to the documentation labels Feb 3, 2021
@dliappis dliappis added this to the 2.0.4 milestone Feb 3, 2021
@dliappis dliappis self-assigned this Feb 3, 2021
In the typical cases of corpora that doesn't include an action-and-meta
-data line, defining the number of documents is simple and equivalent
to the number of lines.

This commit clarifies that this calculation differs when using the
`includes-action-and-meta-data` property and that number of documents
should not include action-and-meta-data lines.
Copy link
Member

@danielmitterdorfer danielmitterdorfer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks fine. I left a suggestion but no need for another review round.

docs/track.rst Outdated

.. note::

In this case the ``documents`` property should only reflect the number of documents and not additionally include the number of action and metadata lines.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"In this case" refers to the case when includes-action-and-meta-data is set to true. Maybe we should say instead:

When ``includes-action-and-meta-data`` is set to ``true``...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks. It's actually an interesting situation when a corpus contains action and metadata line but this property isn't true: it'll try to index them, frequently failing due to strict mappings :)

I've reworded with slightly fewer words to avoid cognitive load from repetition of terms in 20c839e

@dliappis dliappis merged commit 98bd6f1 into elastic:master Feb 3, 2021
@dliappis dliappis deleted the clarify-num-docs-with-action-and-metadata branch February 3, 2021 11:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Docs Changes to the documentation enhancement Improves the status quo
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants