cam6_3_107: Reimplement zonal_mean_mod::Invert_Matrix using LAPACK DGESV #788

brian-eaton · 2023-04-13T18:16:04Z

The Invert_Matrix subroutine in module zonal_mean_mod has been reimplemented using the LAPACK subroutine DGESV. This change causes roundoff level differences in the diagnostic zonal mean fields compared to the original version of Invert_Matrix with the fix from issue #745 applied.

closes #736
fixes #745

peverwhee

Looks good to me!

src/utils/zonal_mean_mod.F90

fvitt

Pleasantly surprised that the differences in the zonal means are only roundoff level differences

patcal

Adding the dimensions to the passed arrays provides clarity and is a good safeguard against a compiler that may not like the (:,:) usage.

brian-eaton · 2023-04-17T22:30:12Z

@patcal, I've been thinking about your comment on using explicit shape dummy args. After a bit of searching on pros/cons of the assumed shape vs explicit shape dummy args I think I'm going to stick with assumed shape. It seems to be a more flexible option, and we could potentially move the Invert_Matrix routine into another module to be used elsewhere in the code.

gold2718 · 2023-04-18T19:52:51Z

Adding the dimensions to the passed arrays provides clarity and is a good safeguard against a compiler that may not like the (:,:) usage.

For the record, adding the dimensions in the declaration statement disables shape checks so if the array passed into that routine does not match the declared size*, the array references will be different than in the calling routine which usually leads to unexpected results.
In my view, this makes Brian's approach safer.

*in at least one dimension that is not the last one.

patcal · 2023-04-18T20:36:52Z

Great, I am just happy that everyone decided that I had it coded correctly in the first place.

…

On Tue, Apr 18, 2023 at 1:53 PM goldy ***@***.***> wrote: Adding the dimensions to the passed arrays provides clarity and is a good safeguard against a compiler that may not like the (:,:) usage. For the record, adding the dimensions in the declaration statement disables shape checks so if the array passed into that routine does not match the declared size*, the array references will be different than in the calling routine which usually leads to unexpected results. In my view, this makes Brian's approach safer. *in at least one dimension that is not the last one. — Reply to this email directly, view it on GitHub <#788 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJMQHAONQSXXAJ3OZZHESB3XB3WJ3ANCNFSM6AAAAAAW5N5D4Q> . You are receiving this because you were mentioned.Message ID: ***@***.***>

sjsprecious · 2023-04-18T21:46:00Z

Adding the dimensions to the passed arrays provides clarity and is a good safeguard against a compiler that may not like the (:,:) usage.

For the record, adding the dimensions in the declaration statement disables shape checks so if the array passed into that routine does not match the declared size*, the array references will be different than in the calling routine which usually leads to unexpected results. In my view, this makes Brian's approach safer.

*in at least one dimension that is not the last one.

Thanks @gold2718 for this additional information, which I did not know before. I just wanted to add another point here: based on David Appelhans's talk at GTC23, using assumed-shape variables could easily hurt the GPU performance, compared to using the variables with explicit size.

gold2718 · 2023-04-19T18:46:20Z

I just wanted to add another point here: based on David Appelhans's talk at GTC23, using assumed-shape variables could easily hurt the GPU performance, compared to using the variables with explicit size.

This seems like a classic tension (engineering for flexibility and testability vs. engineering for performance).
@sjsprecious, do you have any data on this? It would be great to know what the penalty is on real current GPU architectures. How big is the penalty?

sjsprecious · 2023-04-19T19:43:37Z

According to the examples in David's talk, the performance penalty could be between 2x and 40x on NVIDIA's A100 GPU when using the assumed shape variables. You could find more details by searching his GTC23 talk titled "Best Practices for Programming GPUs using Fortran, OpenACC, and CUDA". I could send you a copy of the slides too in case it is no longer accessible for the public.

brian-eaton · 2023-04-20T00:13:48Z

I think in the current context using the assumed shape arrays is fine because Invert_Matrix is just a wrapper for calling DGESV. The dummy arrays in the DGESV interface are assumed size arrays which means the compiler is going to create a copy to provide contiguous storage for the array references that are passed down.

sjsprecious · 2023-04-20T02:08:22Z

Thanks @brian-eaton. I agreed with you that the assumed shape arrays might not be an issue here since you were calling an LAPACK interface and not using any GPU. Just want to mention this potential performance issue somewhere for record but probably I leave it at the wrong place (apologize if that is the case).

In addition, you raised another good point and I want to make it clear: according to David's talk, using assumed size arrays won't hurt the GPU performance but using assumed shape arrays will.

reimplement Invert_Matrix using LAPACK DGESV

b047e96

brian-eaton self-assigned this Apr 13, 2023

brian-eaton added bug Something isn't working correctly enhancement New feature or request CoupledEval3 labels Apr 13, 2023

cacraigucar requested review from patcal, cacraigucar and peverwhee April 13, 2023 18:41

peverwhee approved these changes Apr 17, 2023

View reviewed changes

cacraigucar reviewed Apr 17, 2023

View reviewed changes

src/utils/zonal_mean_mod.F90 Show resolved Hide resolved

cacraigucar requested a review from fvitt April 17, 2023 20:24

fvitt approved these changes Apr 17, 2023

View reviewed changes

patcal approved these changes Apr 17, 2023

View reviewed changes

cacraigucar approved these changes Apr 17, 2023

View reviewed changes

cacraigucar changed the title ~~Reimplement zonal_mean_mod::Invert_Matrix using LAPACK DGESV~~ cam6_3_107: Reimplement zonal_mean_mod::Invert_Matrix using LAPACK DGESV Apr 17, 2023

update ChangeLog

d0f1303

brian-eaton merged commit fc3cc80 into ESCOMP:cam_development Apr 18, 2023

lizziel mentioned this pull request May 1, 2023

cam6_3_147: GEOS-Chem chemistry and four new compsets that use it #484

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cam6_3_107: Reimplement zonal_mean_mod::Invert_Matrix using LAPACK DGESV #788

cam6_3_107: Reimplement zonal_mean_mod::Invert_Matrix using LAPACK DGESV #788

brian-eaton commented Apr 13, 2023

peverwhee left a comment

fvitt left a comment

patcal left a comment

brian-eaton commented Apr 17, 2023

gold2718 commented Apr 18, 2023

patcal commented Apr 18, 2023 via email

sjsprecious commented Apr 18, 2023

gold2718 commented Apr 19, 2023

sjsprecious commented Apr 19, 2023

brian-eaton commented Apr 20, 2023

sjsprecious commented Apr 20, 2023

cam6_3_107: Reimplement zonal_mean_mod::Invert_Matrix using LAPACK DGESV #788

cam6_3_107: Reimplement zonal_mean_mod::Invert_Matrix using LAPACK DGESV #788

Conversation

brian-eaton commented Apr 13, 2023

peverwhee left a comment

Choose a reason for hiding this comment

fvitt left a comment

Choose a reason for hiding this comment

patcal left a comment

Choose a reason for hiding this comment

brian-eaton commented Apr 17, 2023

gold2718 commented Apr 18, 2023

patcal commented Apr 18, 2023 via email

sjsprecious commented Apr 18, 2023

gold2718 commented Apr 19, 2023

sjsprecious commented Apr 19, 2023

brian-eaton commented Apr 20, 2023

sjsprecious commented Apr 20, 2023