Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document and/or gracefully fail creating invalid ILM policies in mixed-cluster #37085

Closed
talevy opened this issue Jan 2, 2019 · 2 comments
Closed
Labels
:Data Management/ILM+SLM Index and Snapshot lifecycle management >docs General docs changes

Comments

@talevy
Copy link
Contributor

talevy commented Jan 2, 2019

Problem Context

While operating a mixed-version cluster where the master node version contains new lifecycle actions
that previously didn't exist, there can be non-friendly failures.

Creating policies with newly introduced actions will result in success on the master, but then
can either fail when hitting ILM apis from older coordinating nodes, or running unsupported actions
on other nodes.

Two scenarios:

  1. Requesting policy from older node:
{"error":{"root_cause":[{"type":"transport_serialization_exception","reason":"Failed to deserialize response from handler [org.elasticsearch.transport.TransportService$Con
textRestoreResponseHandler]"}],"type":"transport_serialization_exception","reason":"Failed to deserialize response from handler [org.elasticsearch.transport.TransportServi
ce$ContextRestoreResponseHandler]","caused_by":{"type":"illegal_argument_exception","reason":"Unknown NamedWriteable [org.elasticsearch.xpack.core.indexlifecycle.Lifecycle
Action][freeze]"}},"status":500}
  1. Executing new lifecycle action that hits APIs that do not exist in older nodes

Solution

There needs to be more guidance on how to best manage new policies while operating mixed clusters. It should be recommended to disable ILM while a rolling-upgrade is in progress. This would avoid these unintuitive errors.

In addition to better guidance in documentation, it may make sense to disallow creation of these invalid policies while in mixed-clusters and explain the situation to the user.

@talevy talevy added >enhancement :Data Management/ILM+SLM Index and Snapshot lifecycle management labels Jan 2, 2019
@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-core-features

@talevy
Copy link
Contributor Author

talevy commented Jan 15, 2019

After some discussion with the team, we decided that it will be sufficient to just document this scenario for now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Data Management/ILM+SLM Index and Snapshot lifecycle management >docs General docs changes
Projects
None yet
Development

No branches or pull requests

2 participants