-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
build_on_raijin.sh fails: can't find NetCDF #24
Comments
ping @nichannah - any suggestions? |
for some reason cmake isn't picking up netcdf/4.3.2 despite having But if I revert 020b141 and use netcdf/4.4.1.1 instead, it compiles fine and the exe appears to be properly linked:
Is something broken with the netcdf/4.3.2 library installation on raijin? |
This appears to be a system problem:
I have emailed help. Perhaps we should move everything to 4.4.1.1? |
Sure, if 4.4.1.1 works just as well as anything else. |
I had an issue with the 4.4.1.1 install at some stage, but perhaps whatever it has been fixed. Is there any reason to not use a 4.6 release? |
good question. Has anyone tried 4.6.1? (and what's the difference between netcdf/4.6.1 and netcdf/4.6.1p?) There's also 4.6.3 and 4.7.0 but apparently not on NCI yet. |
4.6.1p is using parallel netcdf (pHDF5, MPI-IO, etc), no need to use that one. |
I gather from this that we should avoid using netcdf/4.4.* as it fails on floating point errors: http://cosima.org.au/index.php/2018/06/12/technical-working-group-meeting-june-2018/ |
Link to helpdesk discussion: https://track.nci.org.au/servicedesk/customer/portal/5/HELP-163255 |
You can't rely on e.g. Instead, we have compiler and linker wrappers that add include and library paths as needed based on what modules are loaded. In the case of NetCDF, the only thing you need to do is to load the module and add the needed This works very well with |
Thanks @benmenadue, I think I understand now. I've just make a PR that fixes the issue. |
@aekiss #27 looks like it should work, although it's Raijin-specific -- e.g. on your desktop machine you'll probably want to still use the FindNetCDF. Perhaps add a CMake flag that enables / disables using that logic so that you can specifically disable it on systems that use modules?
|
The compiler messages indicate that oasis is being compiled with
|
Thanks @benmenadue, that looks like a good idea |
@aekiss That will give you undefined behaviour -- the |
Hmm, that doesn't sound at all good! |
@aekiss I'm not sure why you would be getting a reference to |
thanks @benmenadue but I still get |
and my |
@aekiss That's odd... I just tried it and it looks fine to me, for example
(the
(and I had no modules loaded before this). Do you have any module load commands in your shell initialisation files? |
my
|
It may be unrelated, but that
and made changes to the global compiler and linker flags:
This is one of the reasons we strongly recommend not using conda-like packaging systems. Try with all of those |
Ah -- you have this in your
which will pick up the default version of 4.2.1.1. Perhaps something in oasis3-mct is stepping through (t)csh and picking it up here? |
Odd. I've commented out every module load that looks relevant in .login and .profile but still have |
It should, unless it's being picked up again by another script running. Do you mind if I try it as you to see if I can find where it's coming from? |
be my guest - thanks! |
do you need me to do anything or are you already able to log in as me? |
Sorry, didn't get a chance to look at this this afternoon, but I can |
ok thanks :-) |
from https://opus.nci.org.au/display/Help/Gadi%3A+NCI%27s+New+Supercomputer |
I still have this issue, despite taking out every potentially relevant module load from my .login, .profile, .rashrc and .bashrc. When I do
the compiler messages include This differs from what I get with
@benmenadue if you have a spare moment to try this logged in as me that would be great. @aidanheerdegen - could you try doing the above to see whether the problem is isolated to me? |
@aekiss
This might be the source of that. But if so, I'm not sure why it wasn't happening for me as well. |
thanks @benmenadue, well spotted! When I do |
Yep, I can confirm that this fixes it. Thanks again @benmenadue |
reopening - also need to deal with PR #27 which should probably be made more portable as per Ben's suggestion #24 (comment) |
I have merged #32 which includes Ben's suggestion. |
Here's what happens when I try to compile:
The text was updated successfully, but these errors were encountered: