Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Internalize triple_chevron #3648

Merged
merged 1 commit into from
Feb 3, 2025

Conversation

bernhardmgruber
Copy link
Contributor

No description provided.

@bernhardmgruber bernhardmgruber requested review from a team as code owners February 3, 2025 10:16
@bernhardmgruber bernhardmgruber changed the title Internalize triple_chevron Internalize triple_chevron Feb 3, 2025
Copy link
Contributor

github-actions bot commented Feb 3, 2025

🟩 CI finished in 1h 37m: Pass: 100%/90 | Total: 2d 16h | Avg: 42m 52s | Max: 1h 17m | Hits: 221%/12730
  • 🟩 cub: Pass: 100%/44 | Total: 1d 15h | Avg: 54m 27s | Max: 1h 17m | Hits: 244%/3500

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 13h | Avg: 54m 16s | Max:  1h 17m | Hits: 244%/3500  
      🟩 arm64              Pass: 100%/2   | Total:  1h 56m | Avg: 58m 26s | Max: 59m 03s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 59m | Avg: 59m 57s | Max:  1h 08m | Hits: 244%/875   
      🟩 12.5               Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m
      🟩 12.8               Pass: 100%/37  | Total:  1d 08h | Avg: 52m 52s | Max:  1h 17m | Hits: 243%/2625  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 59m | Avg: 59m 57s | Max:  1h 08m | Hits: 244%/875   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m
      🟩 nvcc12.8           Pass: 100%/35  | Total:  1d 06h | Avg: 52m 25s | Max:  1h 17m | Hits: 243%/2625  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m
      🟩 nvcc               Pass: 100%/42  | Total:  1d 13h | Avg: 54m 09s | Max:  1h 17m | Hits: 244%/3500  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 49m | Avg: 57m 24s | Max:  1h 02m
      🟩 Clang15            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m
      🟩 Clang16            Pass: 100%/2   | Total:  1h 56m | Avg: 58m 25s | Max:  1h 03m
      🟩 Clang17            Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m
      🟩 Clang18            Pass: 100%/7   | Total:  5h 37m | Avg: 48m 15s | Max:  1h 01m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 55m | Avg: 57m 50s | Max:  1h 00m
      🟩 GCC8               Pass: 100%/1   | Total: 56m 34s | Avg: 56m 34s | Max: 56m 34s
      🟩 GCC9               Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m
      🟩 GCC10              Pass: 100%/2   | Total:  1h 51m | Avg: 55m 42s | Max: 56m 50s
      🟩 GCC11              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC12              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m
      🟩 GCC13              Pass: 100%/10  | Total:  6h 22m | Avg: 38m 15s | Max:  1h 10m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m | Hits: 244%/1750  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 30m | Avg:  1h 15m | Max:  1h 17m | Hits: 243%/1750  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 15h 30m | Avg: 54m 42s | Max:  1h 03m
      🟩 GCC                Pass: 100%/21  | Total: 17h 15m | Avg: 49m 19s | Max:  1h 10m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 50m | Avg:  1h 12m | Max:  1h 17m | Hits: 244%/3500  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 51m 32s | Avg: 25m 46s | Max: 27m 37s
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 11h | Avg:  1h 01m | Max:  1h 17m | Hits: 244%/3500  
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 03m | Avg: 30m 26s | Max:  1h 00m
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 13h | Avg:  1h 00m | Max:  1h 17m | Hits: 244%/3500  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 59s | Avg: 21m 59s | Max: 21m 59s
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 37s | Avg: 17m 37s | Max: 17m 37s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 09m | Avg: 23m 07s | Max: 23m 55s
      🟩 TestGPU            Pass: 100%/2   | Total: 38m 21s | Avg: 19m 10s | Max: 19m 53s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 51m 32s | Avg: 25m 46s | Max: 27m 37s
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 10m | Avg:  1h 10m | Max:  1h 10m
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 20h 37m | Avg:  1h 01m | Max:  1h 17m | Hits: 244%/2625  
      🟩 20                 Pass: 100%/24  | Total: 19h 18m | Avg: 48m 16s | Max:  1h 12m | Hits: 241%/875   
    
  • 🟩 thrust: Pass: 100%/43 | Total: 23h 49m | Avg: 33m 14s | Max: 1h 00m | Hits: 213%/9230

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 40m 05s | Avg: 20m 02s | Max: 28m 56s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 22h 46m | Avg: 33m 20s | Max:  1h 00m | Hits: 213%/9230  
      🟩 arm64              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 21s | Max: 31m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 06m | Avg: 37m 17s | Max: 55m 16s | Hits: 174%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 53m | Avg: 56m 45s | Max: 58m 20s
      🟩 12.8               Pass: 100%/36  | Total: 18h 49m | Avg: 31m 22s | Max:  1h 00m | Hits: 222%/7384  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 57m 03s | Avg: 28m 31s | Max: 30m 20s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 06m | Avg: 37m 17s | Max: 55m 16s | Hits: 174%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 53m | Avg: 56m 45s | Max: 58m 20s
      🟩 nvcc12.8           Pass: 100%/34  | Total: 17h 52m | Avg: 31m 32s | Max:  1h 00m | Hits: 222%/7384  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 57m 03s | Avg: 28m 31s | Max: 30m 20s
      🟩 nvcc               Pass: 100%/41  | Total: 22h 52m | Avg: 33m 28s | Max:  1h 00m | Hits: 213%/9230  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 04s | Max: 33m 03s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 08m | Avg: 34m 01s | Max: 34m 04s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 29s | Max: 32m 44s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 57s | Max: 33m 20s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 47m | Avg: 23m 57s | Max: 32m 25s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 49s | Max: 32m 37s
      🟩 GCC8               Pass: 100%/1   | Total: 32m 03s | Avg: 32m 03s | Max: 32m 03s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 05m | Avg: 32m 41s | Max: 34m 12s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 28s | Max: 35m 29s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 59s | Max: 36m 38s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 10m | Avg: 35m 04s | Max: 36m 39s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 16m | Avg: 24m 30s | Max: 38m 00s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 44s | Max: 55m 16s | Hits: 174%/3692  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 31m | Avg: 50m 31s | Max:  1h 00m | Hits: 238%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 53m | Avg: 56m 45s | Max: 58m 20s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 12m | Avg: 28m 59s | Max: 34m 04s
      🟩 GCC                Pass: 100%/19  | Total:  9h 22m | Avg: 29m 35s | Max: 38m 00s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 21m | Avg: 52m 12s | Max:  1h 00m | Hits: 213%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 45s | Max: 58m 20s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total: 19h 52m | Avg: 36m 08s | Max: 59m 22s | Hits: 174%/5538  
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 56m | Avg: 23m 40s | Max:  1h 00m | Hits: 270%/3692  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 22h 29m | Avg: 36m 28s | Max:  1h 00m | Hits: 174%/7384  
      🟩 TestCPU            Pass: 100%/3   | Total: 46m 41s | Avg: 15m 33s | Max: 31m 31s | Hits: 365%/1846  
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 04s | Avg: 11m 01s | Max: 11m 25s
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total: 34m 28s | Avg: 34m 28s | Max: 34m 28s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 35m | Avg: 37m 46s | Max: 59m 22s | Hits: 174%/5538  
      🟩 20                 Pass: 100%/21  | Total: 10h 34m | Avg: 30m 11s | Max:  1h 00m | Hits: 270%/3692  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 7m 06s | Avg: 3m 33s | Max: 4m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  4m 47s
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  4m 47s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  4m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  4m 47s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  4m 47s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  4m 47s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  4m 47s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 19s | Avg:  2m 19s | Max:  2m 19s
      🟩 Test               Pass: 100%/1   | Total:  4m 47s | Avg:  4m 47s | Max:  4m 47s
    
  • 🟩 python: Pass: 100%/1 | Total: 26m 04s | Avg: 26m 04s | Max: 26m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 26m 04s | Avg: 26m 04s | Max: 26m 04s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 26m 04s | Avg: 26m 04s | Max: 26m 04s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 26m 04s | Avg: 26m 04s | Max: 26m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 26m 04s | Avg: 26m 04s | Max: 26m 04s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 26m 04s | Avg: 26m 04s | Max: 26m 04s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 26m 04s | Avg: 26m 04s | Max: 26m 04s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 26m 04s | Avg: 26m 04s | Max: 26m 04s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 26m 04s | Avg: 26m 04s | Max: 26m 04s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 90)

# Runner
65 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

@bernhardmgruber bernhardmgruber merged commit e586c3b into NVIDIA:main Feb 3, 2025
105 of 108 checks passed
@bernhardmgruber bernhardmgruber deleted the intern_triple_chevron branch February 3, 2025 12:49
Copy link
Contributor

github-actions bot commented Feb 3, 2025

Backport failed for branch/2.8.x, because it was unable to cherry-pick the commit(s).

Please cherry-pick the changes locally.

git fetch origin branch/2.8.x
git worktree add -d .worktree/backport-3648-to-branch/2.8.x origin/branch/2.8.x
cd .worktree/backport-3648-to-branch/2.8.x
git checkout -b backport-3648-to-branch/2.8.x
ancref=$(git merge-base 711ee01ffdd8af27a9ca160c2b4abff523e3e96b 136d56a79961c9575e5ce3cd9fd2017e277d39cb)
git cherry-pick -x $ancref..136d56a79961c9575e5ce3cd9fd2017e277d39cb

bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Feb 3, 2025
bernhardmgruber added a commit that referenced this pull request Feb 3, 2025
Also make cuda::std::identity available in C++11
shwina pushed a commit to shwina/cccl that referenced this pull request Feb 3, 2025
@bernhardmgruber bernhardmgruber mentioned this pull request Feb 5, 2025
36 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants