Patch release for bug fixes.
Commits in this release:
view commit • Updating documentation
view commit • Merge branch 'develop' of github.com:khuck/xpress-apex into develop
view commit • Checking for nvcc 10 and gcc 8 and setting flags accordingly
view commit • Adding periodic plugin example, enabling static global constructors and destructors
view commit • Adding pthread wrapper and screen_output to policy plugin example
view commit • Update README.md
view commit • Re-enablling ability to get vector of available profiles, updated periodic example
view commit • Don't pin threads by default, it's kind of broken on summit
view commit • Fixing HPX build due to static global constructor
view commit • Fixing bug #134. Changing from pthread_setaffinity_np() to sched_get/setaffinity()
view commit • Fixing issue #135 When tracking CPU/GPU activity, the memory allocation counters should be associated with the thread making the call, when writing to OTF2 traces. This change allows for an optional argument to the apex::sample_value call that indicates whether the counter is assocaited with the specific thread or the process as a whole (the default).
view commit • Fixing #137. Now explicitly tracking all memory allocations and frees on both the host and the device.
view commit • Merge branch 'develop' of git.nic.uoregon.edu:/gitroot/xpress-apex into develop
view commit • Re-enable pinning by default
view commit • Fixing #136. Now have the ability to capture task tree, not just graph. No more cycles!
view commit • Adding dependency_tree class
view commit • Fixing build errors for -std=c++11 compliance
view commit • Initial memory wrapper, bugs everywhere
view commit • Adding additional MPI rank detection support
view commit • Fixing build issue with HPX due to modified sample_value function
view commit • Fixing cuda 10.1 build errors.
view commit • Fixing gperftool config by finding correct include location
view commit • Fixing gperftool config by finding correct include location
view commit • Removing some high-overhead and useless counters
view commit • Working memory wrapper for malloc/free, removing pointers from name demangling due to instability
view commit • Merge branch 'develop' of git.nic.uoregon.edu:/gitroot/xpress-apex into develop
view commit • Adding support for calloc and realloc
view commit • Fixing comment
view commit • Adding memory wrapper code for HPX configurations
view commit • Updating copyright to 2021
view commit • Fixing measurement output when dump is called multiple times.
view commit • Fixing tasktree processing for non-timers, adding to apex_exec script
view commit • Merge branch 'develop' of git.nic.uoregon.edu:/gitroot/xpress-apex into develop
view commit • Merge branch 'develop' of git.nic.uoregon.edu:/gitroot/xpress-apex into develop
view commit • Fixing elapsed time in graphs and shortening timer names by not including full file name and path by default
view commit • Fixing concurrency handler static global variable
view commit • Fix HPX barriers in OTF2 output
view commit • Merge pull request #143 from severinstrobl/otf2_hpx_barriers
view commit • Enabling LLVM 11 to build cuda examples
view commit • Forgot to set profiler to "stopped" when adding async activity.
view commit • Removing APEX counters (llvm won't link them?)
view commit • Cleaning up timers. We had been using a custom clock in order to use rdtsc on Intel platforms, but that's kind of pointless. It becomes a nightmare when trying to convert for OTF2 traces, and CUDA (and other GPUs) only provide timestamps in nanoseconds. Therefore, all timing is assumed to be done in nanoseconds now.
view commit • Flush CUPTI before dumping.
view commit • Need to move forward declaration.
view commit • Only override the rank if suspect it's wrong
view commit • Updating version number
view commit • Updating version number.
view commit • Merge branch 'develop'