Modularize POTRF, make DPLASMA optional. #222

therault · 2022-03-11T14:11:28Z

Make POTRF composable via make_ttg.

The goal is threefold:

remove the dependency on DPLASMA, at least for computation, if not for checking
have another example of graphs composition
prepare the fields to have a demonstration of speedup via graph composition

… a deadlock (looking for it, probably a task that is partially triggered by mistake)

in the previous fence(), when inside ttg_finalize(), so that if we're using user-trigger termination detection the parsec_context_wait() on that context will complete during the call to destroy_worlds(). final_task() uses the parsec_taskpool_started member to remember if it is necessay to update the number of pending actions again, allowing the user to terminate the taskpool before, if they want to.

…bly because of porting to new master

continue to move things in appropriate .h file start to refactor profiling

…ributed yet. Test feature with dpotrf. Not integrated well at this time: should that be a functionality of world? Of world::impl? Or of something else?

…ommand of GraphViz to group TTs under the same TTG

… the user has enabled it via MCA.

…g::broadcast()

…creation time, so we can capture the TTG hierarchy in the naming of events; use the actual task class id to store the event identifier.

…c_task_classes is not contiguous. Add some TTG-specific tracing in parsec traces to estimate amount of time spent in dependency management.

…ers of the world so it compiles with both parsec and madness backends.

…filing is enabled, parsec_task_t stores the hash of the task key in the locals, and the pointer to the key after that. Then, the hash is used to identify the task globally in the DOT file, and the key to provide the human-readable name of the task. If set_arg_local_impl is called from a dummy task, we don't output the dependency; we create the destination task (and destroy it) at the end of set_arg_impl, if the destination task is remote, just to be able to output the dependency.

…als from tts *before* they are moved into the ttg's vector of tt; improve passing arguments to testing_dpotri; make testing_dpotri silent for performance measurement; introduce a loop for performance measurement in testing_dpotri

…) we register Ops that can be used in multiple taskpools, and B) we destruct and re-create the taskpool at each fence()

…les, as they are needed for the non-sequential version -- Need to check with Joseph how many of the deep-copy are done, and why. Check validty of POTRF and POTRI calls.

therault · 2022-09-12T13:48:29Z

Superseded by PR #238

therault marked this pull request as draft March 11, 2022 14:11

therault force-pushed the potrf-composition branch from 20dbd01 to 30a3b6b Compare March 16, 2022 16:35

therault added 19 commits April 4, 2022 13:03

Modularize POTRF, make DPLASMA optional.

7e89272

Give names to the edges of the dispatcher TT

cc30936

Forgot to include tt_dispatch in the potrf_ttg

3dcc54d

DAG issues solved, seems to work until ttg::finalize() where we reach…

6697bb4

… a deadlock (looking for it, probably a task that is partially triggered by mistake)

Implementing PLGSY... But now make_tt fails for potrf_dispatch, proba…

4764eac

…bly because of porting to new master

non-const inputs must be passed as &&...

2bdee33

Split POTRF into multiple files; rename main test program

28783b0

Add TRTRI; Seems to work, but I need to write a checker.

b95d924

continue to move things in appropriate .h file start to refactor profiling

Use dynamic termination detection for distributed runs

a892913

Add LAUUM (need a check for that too)

81917cb

Enable PaRSEC profiling system

94b7152

Add support for DAG of tasks creation in PaRSEC. Doesn't work in dist…

d88949f

…ributed yet. Test feature with dpotrf. Not integrated well at this time: should that be a functionality of world? Of world::impl? Or of something else?

Hierarchical version of DOT functionality: use the subgraph cluster c…

f7e5ffa

…ommand of GraphViz to group TTs under the same TTG

Working version of potri

bef86df

Cleanup pass

7e649ae

Another pass of cleanup

69eeb75

Last touches of cleanup?

bb85c67

Command line arguments for dpotri, more cleanups. Enable profiling if…

1d53756

… the user has enabled it via MCA.

therault force-pushed the potrf-composition branch from d3325f3 to 1d53756 Compare April 4, 2022 17:03

therault added 2 commits April 5, 2022 11:00

Slightly more efficient dispatch by std::move() vectors of keys in tt…

ec7b22c

…g::broadcast()

Register the TT for profiling during make_executable() instead of at …

27d2921

…creation time, so we can capture the TTG hierarchy in the naming of events; use the actual task class id to store the event identifier.

devreal mentioned this pull request Apr 6, 2022

Implement simple key ranges for broadcast #230

Open

therault added 3 commits April 7, 2022 14:07

Workaround the fact that TTGs are TTBase, so the identifiers of parse…

8ff50b5

…c_task_classes is not contiguous. Add some TTG-specific tracing in parsec traces to estimate amount of time spent in dependency management.

Use sourceforge repository for doxygen

3979b40

Rename to dag_on/dag_off, and make profile_on/off and dag_on/off memb…

1168298

…ers of the world so it compiles with both parsec and madness backends.

therault marked this pull request as ready for review April 7, 2022 21:02

Merge DOT with disabled type in TTG-hierarchical DOT

c2241cc

therault added 2 commits April 7, 2022 17:50

Track last version of parsec to support profiling

a81e161

therault marked this pull request as draft April 25, 2022 15:45

therault mentioned this pull request Apr 25, 2022

Enable PaRSEC profiling system #227

Closed

evaleev mentioned this pull request Apr 25, 2022

Start working on the front page of user documentation #208

Merged

9 tasks

therault added 4 commits April 27, 2022 15:18

the profiling_array should be persistent between taskpools, because A…

8e50b58

…) we register Ops that can be used in multiple taskpools, and B) we destruct and re-create the taskpool at each fence()

Multiple fixes in POTRF; re-enable deep copy constructor for matrixti…

9669232

…les, as they are needed for the non-sequential version -- Need to check with Joseph how many of the deep-copy are done, and why. Check validty of POTRF and POTRI calls.

Update the potri tester to simplify performance measurements

00bb23d

therault mentioned this pull request Sep 12, 2022

TTG composition, example of POINV, and a few fixes to parsec profiling #238

Merged

therault closed this Sep 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modularize POTRF, make DPLASMA optional. #222

Modularize POTRF, make DPLASMA optional. #222

therault commented Mar 11, 2022

therault commented Sep 12, 2022

Modularize POTRF, make DPLASMA optional. #222

Modularize POTRF, make DPLASMA optional. #222

Conversation

therault commented Mar 11, 2022

therault commented Sep 12, 2022