-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Only build the graph on the batch_worker for non complete tasks. #185
Only build the graph on the batch_worker for non complete tasks. #185
Conversation
Only build the graph on the batch_worker for non complete tasks.
Codecov ReportBase: 59.20% // Head: 59.20% // No change to project coverage 👍
📣 This organization is not using Codecov’s GitHub App Integration. We recommend you install it so Codecov can continue to function properly for your repositories. Learn more Additional details and impacted files@@ Coverage Diff @@
## main #185 +/- ##
=======================================
Coverage 59.20% 59.20%
=======================================
Files 23 23
Lines 1554 1554
=======================================
Hits 920 920
Misses 634 634
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
Sorry, let me open a new PR with a better approach. |
Hmnnn, I can't comment on your infinite loop issue without a minimal reproducible example. Regarding the code change for the While writing this message I saw you closed the PR and announced you will do a new one. Just note I don't really have time for working on b2luigi due to a paper and my thesis deadline in a couple of weeks, so it might take until a weekend or a couple of weeks until I have time to properly review it. Maybe the new option
This is not a b2luigi setting, but a general luigi setting and thus should be set in a export LUIGI_CONFIG_PARSER=toml
export LUIGI_CONFIG_PATH=$HOME/luigi.toml and create a file [worker]
cache_task_completion = true |
Thanks for the quick feedback, and all the best for your submission ! I suspect |
I had an issue where the batch worker seemed to get stuck in infinite loops of task requirements (at least I saw it loop over 1+ million tasks before killing the jobs. This is quite strange as such loops are not present within the pipeline and the actual process call is able to build the graph without issue.
While I haven't fully solved the issue yet, I found setting the batch worker to not check dependents of completed tasks sped up the task processing and at least for now avoided the above issue. I'm still investigating my original issue but this change might be more broadly beneficial.