-
Notifications
You must be signed in to change notification settings - Fork 308
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DAOS-16922 dfuse: Avoid assertion on shutdown #15972
Conversation
bc385da
to
7a89fb6
Compare
Test stage Build on EL 8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15972/1/execution/node/344/log |
Test stage Build RPM on EL 9 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15972/1/execution/node/321/log |
Test stage Build on Leap 15.5 with Intel-C and TARGET_PREFIX completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15972/1/execution/node/375/log |
Test stage Build RPM on EL 8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15972/1/execution/node/370/log |
Test stage Build RPM on Leap 15.5 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15972/1/execution/node/334/log |
Test stage Build DEB on Ubuntu 20.04 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15972/1/execution/node/261/log |
Ticket title is ' Assertion 'atomic_load_relaxed(&ie->ie_open_count) == 0' ' |
When there are open file handles and dfuse is shutdown using umount -f, it would assert due to resources being used. Rather than asserting, just print a warning that we are shutting down ungracefully. Features: dfuse Signed-off-by: Jeff Olivier <[email protected]>
7a89fb6
to
32401f9
Compare
Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15972/2/execution/node/1541/log |
Master branch has some failures that this PR hit but they are clearly unrelated to dfuse. |
Actually, one of the tests timed out while checking the dfuse mount point
It could be a network/system issue but I'm not sure |
Is there a dfuse log? I'm very skeptical this patch could be the root cause |
End has:
|
Log link is useless for me. Can you attach full output in slack or the jira ticket? This is almost certainly a different issue as those errors are happening when dfuse is still running. Looks like it's lost connectivity with the server for whatever reason. My patch simply avoids an assertion during shutdown |
Signed-off-by: Jeff Olivier <[email protected]>
I don't doubt you. I'll send the logs in slack |
When there are open file handles and dfuse is shutdown using umount -f, it would assert due to resources being used. Rather than asserting, just print a warning that we are shutting down ungracefully.
Before requesting gatekeeper:
Features:
(orTest-tag*
) commit pragma was used or there is a reason documented that there are no appropriate tags for this PR.Gatekeeper: