Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Modularize POTRF, make DPLASMA optional. #222

Closed
wants to merge 31 commits into from

Conversation

therault
Copy link
Contributor

Make POTRF composable via make_ttg.

The goal is threefold:

  • remove the dependency on DPLASMA, at least for computation, if not for checking
  • have another example of graphs composition
  • prepare the fields to have a demonstration of speedup via graph composition

@therault therault marked this pull request as draft March 11, 2022 14:11
@therault therault force-pushed the potrf-composition branch from 20dbd01 to 30a3b6b Compare March 16, 2022 16:35
therault added 19 commits April 4, 2022 13:03
… a deadlock (looking for it, probably a task that is partially triggered by mistake)
in the previous fence(), when inside ttg_finalize(), so that
if we're using user-trigger termination detection the parsec_context_wait()
on that context will complete during the call to destroy_worlds().

final_task() uses the parsec_taskpool_started member to remember if it
is necessay to update the number of pending actions again, allowing
the user to terminate the taskpool before, if they want to.
continue to move things in appropriate .h file
start to refactor profiling
…ributed yet. Test feature with dpotrf. Not integrated well at this time: should that be a functionality of world? Of world::impl? Or of something else?
…ommand of GraphViz to group TTs under the same TTG
@therault therault force-pushed the potrf-composition branch from d3325f3 to 1d53756 Compare April 4, 2022 17:03
therault added 2 commits April 5, 2022 11:00
…creation time, so we can capture the TTG hierarchy in the naming of events; use the actual task class id to store the event identifier.
therault added 3 commits April 7, 2022 14:07
…c_task_classes is not contiguous. Add some TTG-specific tracing in parsec traces to estimate amount of time spent in dependency management.
…ers of the world so it compiles with both parsec and madness backends.
@therault therault marked this pull request as ready for review April 7, 2022 21:02
therault added 2 commits April 7, 2022 17:50
…filing is enabled, parsec_task_t stores the hash of the task key in the locals, and the pointer to the key after that. Then, the hash is used to identify the task globally in the DOT file, and the key to provide the human-readable name of the task. If set_arg_local_impl is called from a dummy task, we don't output the dependency; we create the destination task (and destroy it) at the end of set_arg_impl, if the destination task is remote, just to be able to output the dependency.
@therault therault marked this pull request as draft April 25, 2022 15:45
…als from tts *before* they are moved into the ttg's vector of tt; improve passing arguments to testing_dpotri; make testing_dpotri silent for performance measurement; introduce a loop for performance measurement in testing_dpotri
…) we register Ops that can be used in multiple taskpools, and B) we destruct and re-create the taskpool at each fence()
…les, as they are needed for the non-sequential version -- Need to check with Joseph how many of the deep-copy are done, and why. Check validty of POTRF and POTRI calls.
@therault
Copy link
Contributor Author

Superseded by PR #238

@therault therault closed this Sep 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant