Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft for INALT60 recipe #26

Open
wants to merge 17 commits into
base: master
Choose a base branch
from
Open

Conversation

roxyboy
Copy link

@roxyboy roxyboy commented Apr 14, 2021

A recipe PR for lNATL60 maintained by GEOMAR.

@roxyboy roxyboy changed the title lNALT60 recipe Draft for lNALT60 recipe Apr 14, 2021
nitems_per_input=None,
target_chunks={'time_counter': 15}
)
recipe:surf_ocean_4h = NetCDFtoZarrSequentialRecipe(
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rabernat I'm not sure if the syntax is correct here... Could you advise?

@roxyboy roxyboy changed the title Draft for lNALT60 recipe Draft for INALT60 recipe Apr 14, 2021
@rabernat rabernat added the swot-adac SWOT Adopt-a-Crossover Dataset label May 10, 2021
@cisaacstern
Copy link
Member

@roxyboy, the surf_ocean data has two frequencies: 4h and 5d.

My understanding is that each of these frequencies should be its own dataset.

Did I organize these correctly?

Expand for dataset structure 👇
for r in list(recipes):
  print(r)
  d = dict(recipes[r].file_pattern.items())
  for k, v in zip(d.keys(),d.values()):
      print(k, v[87:])
  print()
surf_ocean_4h
(0, 0) /INALT60_4h_surface_u_02.nc
(0, 1) /INALT60_4h_surface_u_03.nc
(0, 2) /INALT60_4h_surface_u_04.nc
(0, 3) /INALT60_4h_surface_u_08.nc
(0, 4) /INALT60_4h_surface_u_09.nc
(0, 5) /INALT60_4h_surface_u_10.nc
(1, 0) /INALT60_4h_surface_v_02.nc
(1, 1) /INALT60_4h_surface_v_03.nc
(1, 2) /INALT60_4h_surface_v_04.nc
(1, 3) /INALT60_4h_surface_v_08.nc
(1, 4) /INALT60_4h_surface_v_09.nc
(1, 5) /INALT60_4h_surface_v_10.nc
(2, 0) /INALT60_4h_surface_hts_02.nc
(2, 1) /INALT60_4h_surface_hts_03.nc
(2, 2) /INALT60_4h_surface_hts_04.nc
(2, 3) /INALT60_4h_surface_hts_08.nc
(2, 4) /INALT60_4h_surface_hts_09.nc
(2, 5) /INALT60_4h_surface_hts_10.nc

surf_ocean_5d
(0, 0) /INALT60_5d_surface_u_02.nc
(0, 1) /INALT60_5d_surface_u_03.nc
(0, 2) /INALT60_5d_surface_u_04.nc
(0, 3) /INALT60_5d_surface_u_08.nc
(0, 4) /INALT60_5d_surface_u_09.nc
(0, 5) /INALT60_5d_surface_u_10.nc
(1, 0) /INALT60_5d_surface_v_02.nc
(1, 1) /INALT60_5d_surface_v_03.nc
(1, 2) /INALT60_5d_surface_v_04.nc
(1, 3) /INALT60_5d_surface_v_08.nc
(1, 4) /INALT60_5d_surface_v_09.nc
(1, 5) /INALT60_5d_surface_v_10.nc
(2, 0) /INALT60_5d_surface_hts_02.nc
(2, 1) /INALT60_5d_surface_hts_03.nc
(2, 2) /INALT60_5d_surface_hts_04.nc
(2, 3) /INALT60_5d_surface_hts_08.nc
(2, 4) /INALT60_5d_surface_hts_09.nc
(2, 5) /INALT60_5d_surface_hts_10.nc

surf_flux_1d
(0, 0) /INALT60_1d_surface_flux_02.nc
(0, 1) /INALT60_1d_surface_flux_03.nc
(0, 2) /INALT60_1d_surface_flux_04.nc
(0, 3) /INALT60_1d_surface_flux_08.nc
(0, 4) /INALT60_1d_surface_flux_09.nc
(0, 5) /INALT60_1d_surface_flux_10.nc
(1, 0) /INALT60_1d_surface_taux_02.nc
(1, 1) /INALT60_1d_surface_taux_03.nc
(1, 2) /INALT60_1d_surface_taux_04.nc
(1, 3) /INALT60_1d_surface_taux_08.nc
(1, 4) /INALT60_1d_surface_taux_09.nc
(1, 5) /INALT60_1d_surface_taux_10.nc
(2, 0) /INALT60_1d_surface_tauy_02.nc
(2, 1) /INALT60_1d_surface_tauy_03.nc
(2, 2) /INALT60_1d_surface_tauy_04.nc
(2, 3) /INALT60_1d_surface_tauy_08.nc
(2, 4) /INALT60_1d_surface_tauy_09.nc
(2, 5) /INALT60_1d_surface_tauy_10.nc

int_ocean_1d
(0, 0) /INALT60_1d_upper1000m_ts_02.nc
(0, 1) /INALT60_1d_upper1000m_ts_03.nc
(0, 2) /INALT60_1d_upper1000m_ts_04.nc
(0, 3) /INALT60_1d_upper1000m_ts_08.nc
(0, 4) /INALT60_1d_upper1000m_ts_09.nc
(0, 5) /INALT60_1d_upper1000m_ts_10.nc
(1, 0) /INALT60_1d_upper1000m_u_02.nc
(1, 1) /INALT60_1d_upper1000m_u_03.nc
(1, 2) /INALT60_1d_upper1000m_u_04.nc
(1, 3) /INALT60_1d_upper1000m_u_08.nc
(1, 4) /INALT60_1d_upper1000m_u_09.nc
(1, 5) /INALT60_1d_upper1000m_u_10.nc
(2, 0) /INALT60_1d_upper1000m_v_02.nc
(2, 1) /INALT60_1d_upper1000m_v_03.nc
(2, 2) /INALT60_1d_upper1000m_v_04.nc
(2, 3) /INALT60_1d_upper1000m_v_08.nc
(2, 4) /INALT60_1d_upper1000m_v_09.nc
(2, 5) /INALT60_1d_upper1000m_v_10.nc
(3, 0) /INALT60_1d_upper1000m_w_02.nc
(3, 1) /INALT60_1d_upper1000m_w_03.nc
(3, 2) /INALT60_1d_upper1000m_w_04.nc
(3, 3) /INALT60_1d_upper1000m_w_08.nc
(3, 4) /INALT60_1d_upper1000m_w_09.nc
(3, 5) /INALT60_1d_upper1000m_w_10.nc

grid
(0,) /INALT60_mesh_mask.nc

@roxyboy
Copy link
Author

roxyboy commented May 22, 2021

@roxyboy, the surf_ocean data has two frequencies: 4h and 5d.

My understanding is that each of these frequencies should be its own dataset.

Did I organize these correctly?

Yes, it looks great! :)

@cisaacstern
Copy link
Member

@roxyboy, the surface datasets for INALT60 are now on OSN. The following will return a dictionary containing all three:

import s3fs
import xarray as xr

endpoint_url = 'https://ncsa.osn.xsede.org'
fs_osn = s3fs.S3FileSystem(anon=True, client_kwargs={'endpoint_url': endpoint_url},)

url = "s3://Pangeo/pangeo-forge/swot_adac/INALT60/"
inalt60_datasets = {
    ds: xr.open_zarr(fs_osn.get_mapper(f"{url}{ds}.zarr"), consolidated=True)
    for ds in ["surf_ocean_4h", "surf_ocean_5d", "surf_flux_1d"]
}
inalt60_datasets

As we discussed above, these were the only swot-adac recipes which required variable merging, so I'll be interested to know if the zarr stores were built correctly.

@roxyboy
Copy link
Author

roxyboy commented May 26, 2021

@cisaacstern Thanks again for working on this but it seems that surf_ocean_4h and surf_ocean_5d only includes the zonal velocity (u) as a variable while it should also have v and hts...

@rabernat
Copy link
Contributor

In addition to the missing variables: these also seem to have the same problem as noted in #29 (comment): no actual data! For example

fs_osn.ls("s3://Pangeo/pangeo-forge/swot_adac/INALT60/surf_ocean_4h.zarr/vozocrtx")
['Pangeo/pangeo-forge/swot_adac/INALT60/surf_ocean_4h.zarr/vozocrtx/.zarray',
 'Pangeo/pangeo-forge/swot_adac/INALT60/surf_ocean_4h.zarr/vozocrtx/.zattrs']

Another way to see this is via the Zarr "Chunks initialized" statistics

import zarr
group = zarr.open_consolidated(fs_osn.get_mapper("s3://Pangeo/pangeo-forge/swot_adac/INALT60/surf_ocean_4h.zarr"))
group['vozocrtx'].info
Name               : /vozocrtx
Type               : zarr.core.Array
Data type          : float32
Shape              : (1020, 738, 509)
Chunk shape        : (15, 738, 509)
Order              : C
Read-only          : False
Compressor         : Blosc(cname='lz4', clevel=5, shuffle=SHUFFLE, blocksize=0)
Store type         : zarr.storage.ConsolidatedMetadataStore
Chunk store type   : fsspec.mapping.FSMap
No. bytes          : 1532619360 (1.4G)
Chunks initialized : 0/68

We can see that none of the chunks have been initialized.

@cisaacstern let's sync up at some point today to dig into what might be going wrong.

@cisaacstern
Copy link
Member

Noting that based on my conversation with Ryan, this issue appears to be a variant of the one described in #29 (comment).

IIUC, both issues arise when an attempt is made to open an empty zarr store. In this recipe, that happens on this call to ds.to_zarr. The mode="a" kwarg passed to this function designates it as an "append" operation, and the error seems to arise when xarray's zarr backend attempts to open the store to which the data should be appended. Full details (fyi: quite long!) here:

Logs + Traceback
RUNNING surf_ocean_4h
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(0, 0)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_02.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_02.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(0, 0)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (0, 0): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_02.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_02.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1020, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-02-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vozocrtx      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean U grid variables in INALT60 region
    title:                     ocean U grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:04 CEST
    history:                   Tue Apr 10 18:14:46 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(0, 1)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_03.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_03.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(0, 1)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (0, 1): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_03.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_03.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1116, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-03-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vozocrtx      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean U grid variables in INALT60 region
    title:                     ocean U grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:04 CEST
    history:                   Tue Apr 10 18:14:46 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(0, 2)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_04.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_04.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(0, 2)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (0, 2): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_04.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_04.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1080, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-04-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vozocrtx      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean U grid variables in INALT60 region
    title:                     ocean U grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:04 CEST
    history:                   Tue Apr 10 18:14:46 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(0, 3)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_08.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_08.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(0, 3)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (0, 3): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_08.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_08.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1116, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-08-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vozocrtx      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean U grid variables in INALT60 region
    title:                     ocean U grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:04 CEST
    history:                   Tue Apr 10 18:14:46 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(0, 4)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_09.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_09.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(0, 4)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (0, 4): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_09.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_09.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1080, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-09-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vozocrtx      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean U grid variables in INALT60 region
    title:                     ocean U grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:04 CEST
    history:                   Tue Apr 10 18:14:46 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(0, 5)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_10.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_10.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(0, 5)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (0, 5): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_10.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_10.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1116, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-10-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vozocrtx      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean U grid variables in INALT60 region
    title:                     ocean U grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:04 CEST
    history:                   Tue Apr 10 18:14:46 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(1, 0)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_02.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_02.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(1, 0)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (1, 0): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_02.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_02.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1020, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-02-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vomecrty      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean V grid variables in INALT60 region
    title:                     ocean V grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:08 CEST
    history:                   Tue Apr 10 18:14:43 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(1, 1)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_03.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_03.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(1, 1)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (1, 1): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_03.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_03.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1116, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-03-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vomecrty      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean V grid variables in INALT60 region
    title:                     ocean V grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:08 CEST
    history:                   Tue Apr 10 18:14:43 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(1, 2)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_04.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_04.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(1, 2)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (1, 2): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_04.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_04.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1080, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-04-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vomecrty      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean V grid variables in INALT60 region
    title:                     ocean V grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:08 CEST
    history:                   Tue Apr 10 18:14:43 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(1, 3)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_08.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_08.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(1, 3)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (1, 3): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_08.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_08.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1116, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-08-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vomecrty      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean V grid variables in INALT60 region
    title:                     ocean V grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:08 CEST
    history:                   Tue Apr 10 18:14:43 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(1, 4)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_09.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_09.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(1, 4)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (1, 4): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_09.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_09.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1080, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-09-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vomecrty      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean V grid variables in INALT60 region
    title:                     ocean V grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:08 CEST
    history:                   Tue Apr 10 18:14:43 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(1, 5)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_10.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_10.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(1, 5)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (1, 5): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_10.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_10.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1116, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-10-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vomecrty      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean V grid variables in INALT60 region
    title:                     ocean V grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:08 CEST
    history:                   Tue Apr 10 18:14:43 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(2, 0)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_02.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_02.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(2, 0)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (2, 0): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_02.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_02.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1020, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-02-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    sossheig      (time_counter, y, x) float32 ...
    votemper      (time_counter, y, x) float32 ...
    vosaline      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean T grid variables in INALT60 region
    title:                     ocean T grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:28:56 CEST
    history:                   Tue Apr 10 18:24:54 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(2, 1)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_03.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_03.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(2, 1)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (2, 1): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_03.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_03.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1116, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-03-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    sossheig      (time_counter, y, x) float32 ...
    votemper      (time_counter, y, x) float32 ...
    vosaline      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean T grid variables in INALT60 region
    title:                     ocean T grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:28:56 CEST
    history:                   Tue Apr 10 18:24:54 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(2, 2)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_04.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_04.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(2, 2)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (2, 2): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_04.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_04.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1080, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-04-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    sossheig      (time_counter, y, x) float32 ...
    votemper      (time_counter, y, x) float32 ...
    vosaline      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean T grid variables in INALT60 region
    title:                     ocean T grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:28:56 CEST
    history:                   Tue Apr 10 18:24:54 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(2, 3)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_08.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_08.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(2, 3)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (2, 3): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_08.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_08.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1116, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-08-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    sossheig      (time_counter, y, x) float32 ...
    votemper      (time_counter, y, x) float32 ...
    vosaline      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean T grid variables in INALT60 region
    title:                     ocean T grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:28:56 CEST
    history:                   Tue Apr 10 18:24:54 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(2, 4)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_09.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_09.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(2, 4)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (2, 4): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_09.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_09.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1080, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-09-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    sossheig      (time_counter, y, x) float32 ...
    votemper      (time_counter, y, x) float32 ...
    vosaline      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean T grid variables in INALT60 region
    title:                     ocean T grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:28:56 CEST
    history:                   Tue Apr 10 18:24:54 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching input '(2, 5)'
pangeo_forge_recipes.storage - INFO - Caching file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_10.nc'
pangeo_forge_recipes.storage - INFO - Coping remote file 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_10.nc' to cache
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Caching metadata for input '(2, 5)'
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (2, 5): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_10.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_hts_10.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1116, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-10-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    sossheig      (time_counter, y, x) float32 ...
    votemper      (time_counter, y, x) float32 ...
    vosaline      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean T grid variables in INALT60 region
    title:                     ocean T grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:28:56 CEST
    history:                   Tue Apr 10 18:24:54 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Creating a new dataset in target
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening inputs for chunk (0, 0)
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (0, 0): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_02.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_u_02.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1020, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-02-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vozocrtx      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean U grid variables in INALT60 region
    title:                     ocean U grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:04 CEST
    history:                   Tue Apr 10 18:14:46 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Combining inputs for chunk '(0, 0)'
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1020, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-02-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vozocrtx      (time_counter, y, x) float32 dask.array<chunksize=(1020, 738, 509), meta=np.ndarray>
    nav_lon       (x) float32 dask.array<chunksize=(509,), meta=np.ndarray>
    nav_lat       (y) float32 dask.array<chunksize=(738,), meta=np.ndarray>
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean U grid variables in INALT60 region
    title:                     ocean U grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:04 CEST
    history:                   Tue Apr 10 18:14:46 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable vozocrtx encoding chunks to (15, 738, 509)
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable nav_lon encoding chunks to (509,)
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable nav_lat encoding chunks to (738,)
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable time_counter encoding chunks to (15,)
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable depth encoding chunks to ()
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Storing dataset in s3://Pangeo/pangeo-forge/swot_adac/INALT60/surf_ocean_4h.zarr
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1020, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-02-01T02:00:00 ... 2017-...
    depth         float64 1.571
Dimensions without coordinates: x, y
Data variables:
    vozocrtx      (time_counter, y, x) float32 dask.array<chunksize=(1020, 738, 509), meta=np.ndarray>
    nav_lon       (x) float32 11.52 11.54 11.56 11.57 ... 19.96 19.98 19.99
    nav_lat       (y) float32 -40.49 -40.47 -40.46 ... -30.54 -30.53 -30.51
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean U grid variables in INALT60 region
    title:                     ocean U grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:04 CEST
    history:                   Tue Apr 10 18:14:46 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening inputs for chunk (1, 0)
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Opening input with Xarray (1, 0): 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_02.nc'
pangeo_forge_recipes.storage - INFO - Opening 'https://data.geomar.de/downloads/20.500.12085/0e95d316-f1ba-47e3-b667-fc800afafe22/data/INALT60_4h_surface_v_02.nc' from cache
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1020, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-02-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vomecrty      (time_counter, y, x) float32 ...
    nav_lon       (x) float32 ...
    nav_lat       (y) float32 ...
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean V grid variables in INALT60 region
    title:                     ocean V grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:08 CEST
    history:                   Tue Apr 10 18:14:43 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Combining inputs for chunk '(1, 0)'
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1020, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-02-01T02:00:00 ... 2017-...
    depth         float64 ...
Dimensions without coordinates: x, y
Data variables:
    vomecrty      (time_counter, y, x) float32 dask.array<chunksize=(1020, 738, 509), meta=np.ndarray>
    nav_lon       (x) float32 dask.array<chunksize=(509,), meta=np.ndarray>
    nav_lat       (y) float32 dask.array<chunksize=(738,), meta=np.ndarray>
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean V grid variables in INALT60 region
    title:                     ocean V grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:08 CEST
    history:                   Tue Apr 10 18:14:43 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable vomecrty encoding chunks to (15, 738, 509)
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable nav_lon encoding chunks to (509,)
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable nav_lat encoding chunks to (738,)
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable time_counter encoding chunks to (15,)
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - Setting variable depth encoding chunks to ()
pangeo_forge_recipes.recipes.xarray_zarr - INFO - Storing dataset in s3://Pangeo/pangeo-forge/swot_adac/INALT60/surf_ocean_4h.zarr
pangeo_forge_recipes.recipes.xarray_zarr - DEBUG - <xarray.Dataset>
Dimensions:       (time_counter: 1020, x: 509, y: 738)
Coordinates:
  * time_counter  (time_counter) datetime64[ns] 2012-02-01T02:00:00 ... 2017-...
    depth         float64 1.571
Dimensions without coordinates: x, y
Data variables:
    vomecrty      (time_counter, y, x) float32 dask.array<chunksize=(1020, 738, 509), meta=np.ndarray>
    nav_lon       (x) float32 11.52 11.53 11.55 11.57 ... 19.95 19.97 19.98
    nav_lat       (y) float32 -40.48 -40.47 -40.46 ... -30.53 -30.52 -30.51
Attributes:
    name:                      2_INALT60.L120-KRS0020_4h_20100101_20100105
    description:               ocean V grid variables in INALT60 region
    title:                     ocean V grid variables in INALT60 region
    Conventions:               CF-1.5
    production:                An IPSL model
    timeStamp:                 2018-Apr-10 14:29:08 CEST
    history:                   Tue Apr 10 18:14:43 2018: ncrcat /gfs2/work/sh...
    NCO:                       4.4.8
    nco_openmp_thread_number:  1
---------------------------------------------------------------------------
GroupNotFoundError                        Traceback (most recent call last)
<ipython-input-10-bccf9db82171> in <module>
      2     try:
----> 3         r.open_target()
      4         print(f"found {recipe_key}")

/srv/conda/envs/notebook/lib/python3.8/site-packages/pangeo_forge_recipes/recipes/xarray_zarr.py in open_target(self)
    398         target_mapper = self.target.get_mapper()
--> 399         return xr.open_zarr(target_mapper)
    400 

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in open_zarr(store, group, synchronizer, chunks, decode_cf, mask_and_scale, decode_times, concat_characters, decode_coords, drop_variables, consolidated, overwrite_encoded_chunks, chunk_store, storage_options, decode_timedelta, use_cftime, **kwargs)
    686 
--> 687     ds = open_dataset(
    688         filename_or_obj=store,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/api.py in open_dataset(filename_or_obj, engine, chunks, cache, decode_cf, mask_and_scale, decode_times, decode_timedelta, use_cftime, concat_characters, decode_coords, drop_variables, backend_kwargs, *args, **kwargs)
    495     overwrite_encoded_chunks = kwargs.pop("overwrite_encoded_chunks", None)
--> 496     backend_ds = backend.open_dataset(
    497         filename_or_obj,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in open_dataset(self, filename_or_obj, mask_and_scale, decode_times, concat_characters, decode_coords, drop_variables, use_cftime, decode_timedelta, group, mode, synchronizer, consolidated, consolidate_on_close, chunk_store, storage_options, lock)
    733         filename_or_obj = _normalize_path(filename_or_obj)
--> 734         store = ZarrStore.open_group(
    735             filename_or_obj,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in open_group(cls, store, mode, synchronizer, group, consolidated, consolidate_on_close, chunk_store, storage_options, append_dim, write_region, safe_chunks)
    337         else:
--> 338             zarr_group = zarr.open_group(store, **open_kwargs)
    339         return cls(

/srv/conda/envs/notebook/lib/python3.8/site-packages/zarr/hierarchy.py in open_group(store, mode, cache_attrs, synchronizer, path, chunk_store, storage_options)
   1165         elif not contains_group(store, path=path):
-> 1166             raise GroupNotFoundError(path)
   1167 

GroupNotFoundError: group not found at path ''

During handling of the above exception, another exception occurred:

GroupNotFoundError                        Traceback (most recent call last)
/srv/conda/envs/notebook/lib/python3.8/site-packages/pangeo_forge_recipes/recipes/xarray_zarr.py in prepare_target(self)
    212         try:
--> 213             ds = self.open_target()
    214             logger.info("Found an existing dataset in target")

/srv/conda/envs/notebook/lib/python3.8/site-packages/pangeo_forge_recipes/recipes/xarray_zarr.py in open_target(self)
    398         target_mapper = self.target.get_mapper()
--> 399         return xr.open_zarr(target_mapper)
    400 

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in open_zarr(store, group, synchronizer, chunks, decode_cf, mask_and_scale, decode_times, concat_characters, decode_coords, drop_variables, consolidated, overwrite_encoded_chunks, chunk_store, storage_options, decode_timedelta, use_cftime, **kwargs)
    686 
--> 687     ds = open_dataset(
    688         filename_or_obj=store,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/api.py in open_dataset(filename_or_obj, engine, chunks, cache, decode_cf, mask_and_scale, decode_times, decode_timedelta, use_cftime, concat_characters, decode_coords, drop_variables, backend_kwargs, *args, **kwargs)
    495     overwrite_encoded_chunks = kwargs.pop("overwrite_encoded_chunks", None)
--> 496     backend_ds = backend.open_dataset(
    497         filename_or_obj,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in open_dataset(self, filename_or_obj, mask_and_scale, decode_times, concat_characters, decode_coords, drop_variables, use_cftime, decode_timedelta, group, mode, synchronizer, consolidated, consolidate_on_close, chunk_store, storage_options, lock)
    733         filename_or_obj = _normalize_path(filename_or_obj)
--> 734         store = ZarrStore.open_group(
    735             filename_or_obj,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in open_group(cls, store, mode, synchronizer, group, consolidated, consolidate_on_close, chunk_store, storage_options, append_dim, write_region, safe_chunks)
    337         else:
--> 338             zarr_group = zarr.open_group(store, **open_kwargs)
    339         return cls(

/srv/conda/envs/notebook/lib/python3.8/site-packages/zarr/hierarchy.py in open_group(store, mode, cache_attrs, synchronizer, path, chunk_store, storage_options)
   1165         elif not contains_group(store, path=path):
-> 1166             raise GroupNotFoundError(path)
   1167 

GroupNotFoundError: group not found at path ''

During handling of the above exception, another exception occurred:

GroupNotFoundError                        Traceback (most recent call last)
<ipython-input-10-bccf9db82171> in <module>
      7         pl = r.to_pipelines()
      8         plan = executor.pipelines_to_plan(pl)
----> 9         executor.execute_plan(plan)

/srv/conda/envs/notebook/lib/python3.8/site-packages/rechunker/executors/python.py in execute_plan(self, plan, **kwargs)
     30 
     31     def execute_plan(self, plan: Task, **kwargs):
---> 32         plan()
     33 
     34 

/srv/conda/envs/notebook/lib/python3.8/site-packages/rechunker/executors/python.py in _execute_all(tasks)
     35 def _execute_all(tasks: Iterable[Task]) -> None:
     36     for task in tasks:
---> 37         task()

/srv/conda/envs/notebook/lib/python3.8/site-packages/pangeo_forge_recipes/recipes/xarray_zarr.py in prepare_target(self)
    260                             "ignore"
    261                         )  # suppress the warning that comes with safe_chunks
--> 262                         ds.to_zarr(target_mapper, mode="a", compute=False, safe_chunks=False)
    263 
    264         # Regardless of whether there is an existing dataset or we are creating a new one,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/core/dataset.py in to_zarr(self, store, chunk_store, mode, synchronizer, group, encoding, compute, consolidated, append_dim, region, safe_chunks)
   1920             encoding = {}
   1921 
-> 1922         return to_zarr(
   1923             self,
   1924             store=store,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/api.py in to_zarr(dataset, store, chunk_store, mode, synchronizer, group, encoding, compute, consolidated, append_dim, region, safe_chunks)
   1445     writer = ArrayWriter()
   1446     # TODO: figure out how to properly handle unlimited_dims
-> 1447     dump_to_store(dataset, zstore, writer, encoding=encoding)
   1448     writes = writer.sync(compute=compute)
   1449 

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/api.py in dump_to_store(dataset, store, writer, encoder, encoding, unlimited_dims)
   1121         variables, attrs = encoder(variables, attrs)
   1122 
-> 1123     store.store(variables, attrs, check_encoding, writer, unlimited_dims=unlimited_dims)
   1124 
   1125 

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in store(self, variables, attributes, check_encoding_set, writer, unlimited_dims)
    462             # there are variables to append
    463             # their encoding must be the same as in the store
--> 464             ds = open_zarr(self.ds.store, group=self.ds.path, chunks=None)
    465             variables_with_encoding = {}
    466             for vn in existing_variables:

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in open_zarr(store, group, synchronizer, chunks, decode_cf, mask_and_scale, decode_times, concat_characters, decode_coords, drop_variables, consolidated, overwrite_encoded_chunks, chunk_store, storage_options, decode_timedelta, use_cftime, **kwargs)
    685     }
    686 
--> 687     ds = open_dataset(
    688         filename_or_obj=store,
    689         group=group,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/api.py in open_dataset(filename_or_obj, engine, chunks, cache, decode_cf, mask_and_scale, decode_times, decode_timedelta, use_cftime, concat_characters, decode_coords, drop_variables, backend_kwargs, *args, **kwargs)
    494 
    495     overwrite_encoded_chunks = kwargs.pop("overwrite_encoded_chunks", None)
--> 496     backend_ds = backend.open_dataset(
    497         filename_or_obj,
    498         drop_variables=drop_variables,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in open_dataset(self, filename_or_obj, mask_and_scale, decode_times, concat_characters, decode_coords, drop_variables, use_cftime, decode_timedelta, group, mode, synchronizer, consolidated, consolidate_on_close, chunk_store, storage_options, lock)
    732 
    733         filename_or_obj = _normalize_path(filename_or_obj)
--> 734         store = ZarrStore.open_group(
    735             filename_or_obj,
    736             group=group,

/srv/conda/envs/notebook/lib/python3.8/site-packages/xarray/backends/zarr.py in open_group(cls, store, mode, synchronizer, group, consolidated, consolidate_on_close, chunk_store, storage_options, append_dim, write_region, safe_chunks)
    336             zarr_group = zarr.open_consolidated(store, **open_kwargs)
    337         else:
--> 338             zarr_group = zarr.open_group(store, **open_kwargs)
    339         return cls(
    340             zarr_group, consolidate_on_close, append_dim, write_region, safe_chunks

/srv/conda/envs/notebook/lib/python3.8/site-packages/zarr/hierarchy.py in open_group(store, mode, cache_attrs, synchronizer, path, chunk_store, storage_options)
   1164             raise ContainsArrayError(path)
   1165         elif not contains_group(store, path=path):
-> 1166             raise GroupNotFoundError(path)
   1167 
   1168     elif mode == 'w':

GroupNotFoundError: group not found at path ''

I'm now filling in the missing data (and variables) and will ping the thread again when that's complete.

@cisaacstern
Copy link
Member

cisaacstern commented May 27, 2021

@roxyboy, the previously missing INALT60 surface data (and variables) should now accessible via the same xarray dictionary provided in my earlier comment on this thread.

Here's a preview of the variables in each zarr store and their sizes
import s3fs
import zarr

endpoint_url = 'https://ncsa.osn.xsede.org'
fs_osn = s3fs.S3FileSystem(anon=True, client_kwargs={'endpoint_url': endpoint_url},)

url_base = "s3://Pangeo/pangeo-forge/swot_adac/INALT60/"
root_paths = [f"{url_base}{ds}.zarr" for ds in ("surf_ocean_4h", "surf_ocean_5d", "surf_flux_1d")]

vars_a = ["sossheig", "vomecrty", "vosaline", "votemper", "vozocrtx",]
vars_b = ["sohefldo", "sometauy", "sowaflup", "sozotaux",]

for r in root_paths:
    print(r)
    group = zarr.open_consolidated(fs_osn.get_mapper(r))
    
    variables = vars_a if "ocean" in r else vars_b
    
    for v in variables:
        group_info = group[v].info_items()
        print(f"""{group_info[0][1]}
        {group_info[-2]}
        {group_info[-1]}
        """)
s3://Pangeo/pangeo-forge/swot_adac/INALT60/surf_ocean_4h.zarr
/sossheig
        ('No. bytes', '9808763904 (9.1G)')
        ('Chunks initialized', '436/436')
        
/vomecrty
        ('No. bytes', '9808763904 (9.1G)')
        ('Chunks initialized', '436/436')
        
/vosaline
        ('No. bytes', '9808763904 (9.1G)')
        ('Chunks initialized', '436/436')
        
/votemper
        ('No. bytes', '9808763904 (9.1G)')
        ('Chunks initialized', '436/436')
        
/vozocrtx
        ('No. bytes', '9808763904 (9.1G)')
        ('Chunks initialized', '436/436')
        
s3://Pangeo/pangeo-forge/swot_adac/INALT60/surf_ocean_5d.zarr
/sossheig
        ('No. bytes', '327559824 (312.4M)')
        ('Chunks initialized', '15/15')
        
/vomecrty
        ('No. bytes', '327559824 (312.4M)')
        ('Chunks initialized', '15/15')
        
/vosaline
        ('No. bytes', '327559824 (312.4M)')
        ('Chunks initialized', '15/15')
        
/votemper
        ('No. bytes', '327559824 (312.4M)')
        ('Chunks initialized', '15/15')
        
/vozocrtx
        ('No. bytes', '327559824 (312.4M)')
        ('Chunks initialized', '15/15')
        
s3://Pangeo/pangeo-forge/swot_adac/INALT60/surf_flux_1d.zarr
/sohefldo
        ('No. bytes', '1634793984 (1.5G)')
        ('Chunks initialized', '73/73')
        
/sometauy
        ('No. bytes', '1634793984 (1.5G)')
        ('Chunks initialized', '73/73')
        
/sowaflup
        ('No. bytes', '1634793984 (1.5G)')
        ('Chunks initialized', '73/73')
        
/sozotaux
        ('No. bytes', '1634793984 (1.5G)')
        ('Chunks initialized', '73/73')

As in your helpful comment #29 (comment) (which I will now take a look at in detail), please do let me know if anything seems opaque or out-of-place with these stores. It's a learning process, but I think we're getting a lot closer already!

@roxyboy
Copy link
Author

roxyboy commented May 28, 2021

@roxyboy, the previously missing INALT60 surface data (and variables) should now accessible via the same xarray dictionary provided in my earlier comment on this thread.

@cisaacstern Yes, thank you :) Also, could you push the grid files for HYCOM50 (#29 ) and INALT60 (#26 )?

@cisaacstern
Copy link
Member

could you push the grid files for ... INALT60 ?

Yep! The INALT60 grid is now on OSN, as shown below.

Question: I note that the xarray.Dataset doesn't have any coordinates. Is that what you'd expect/prefer, or is there some further processing we can apply to make this grid even more useful to you?

import s3fs
import xarray as xr

endpoint_url = 'https://ncsa.osn.xsede.org'
fs_osn = s3fs.S3FileSystem(anon=True, client_kwargs={'endpoint_url': endpoint_url},)

url = "s3://Pangeo/pangeo-forge/swot_adac/INALT60/grid.zarr"
inalt60_grid = xr.open_zarr(fs_osn.get_mapper(url), consolidated=True)

print(inalt60_grid)
<xarray.Dataset>
Dimensions:       (x: 509, y: 738, z: 120)
Dimensions without coordinates: x, y, z
Data variables: (12/44)
    e1f           (y, x) float64 dask.array<chunksize=(738, 509), meta=np.ndarray>
    e1t           (y, x) float64 dask.array<chunksize=(738, 509), meta=np.ndarray>
    e1u           (y, x) float64 dask.array<chunksize=(738, 509), meta=np.ndarray>
    e1v           (y, x) float64 dask.array<chunksize=(738, 509), meta=np.ndarray>
    e2f           (y, x) float64 dask.array<chunksize=(738, 509), meta=np.ndarray>
    e2t           (y, x) float64 dask.array<chunksize=(738, 509), meta=np.ndarray>
    ...            ...
    tmask         (z, y, x) float64 dask.array<chunksize=(120, 738, 509), meta=np.ndarray>
    tmaskutil     (y, x) float64 dask.array<chunksize=(738, 509), meta=np.ndarray>
    umask         (z, y, x) float64 dask.array<chunksize=(120, 738, 509), meta=np.ndarray>
    umaskutil     (y, x) float64 dask.array<chunksize=(738, 509), meta=np.ndarray>
    vmask         (z, y, x) float64 dask.array<chunksize=(120, 738, 509), meta=np.ndarray>
    vmaskutil     (y, x) float64 dask.array<chunksize=(738, 509), meta=np.ndarray>

@roxyboy
Copy link
Author

roxyboy commented Jun 2, 2021

Yep! The INALT60 grid is now on OSN, as shown below.

Thanks a bunch!

Question: I note that the xarray.Dataset doesn't have any coordinates. Is that what you'd expect/prefer, or is there some further processing we can apply to make this grid even more useful to you?

The same issue exists also with eNATL60 where both are NEMO outputs. Maybe @lesommer or @auraoupa will have more to say but it seems that the staggered C-grid metadata isn't carried over to the netcdf files.

@roxyboy
Copy link
Author

roxyboy commented Jul 5, 2021

The data provider for INALT60 asked for password protection for their data on OSN. Sorry for the additional work but could we do this?

@cisaacstern
Copy link
Member

password protection for their data on OSN

@roxyboy, it is not possible to password-protect data on our OSN bucket, so we will have to find another bucket to write it to. i do not currently have credentials for another bucket, but i will ask others on our team if there's one we can use. given that ryan is on vacation, it's possible it may take up to a few weeks to identify a password-protectable bucket (though it could be faster).

in the interim, would you like me to delete all of the INALT60 data from OSN? or would you prefer that i leave it there until we've identified an alternative location for it? (it's up to you; from a technical perspective either is fine.)

@roxyboy
Copy link
Author

roxyboy commented Jul 6, 2021

@roxyboy, it is not possible to password-protect data on our OSN bucket, so we will have to find another bucket to write it to. i do not currently have credentials for another bucket, but i will ask others on our team if there's one we can use. given that ryan is on vacation, it's possible it may take up to a few weeks to identify a password-protectable bucket (though it could be faster).

in the interim, would you like me to delete all of the INALT60 data from OSN? or would you prefer that i leave it there until we've identified an alternative location for it? (it's up to you; from a technical perspective either is fine.)

Ok, thanks for the speedy updates. I'll ask the INALT60 crew and see what they'd prefer.

@cisaacstern
Copy link
Member

@sharkinsspatial, per #26 (comment) above, @roxyboy and I are in search of a password-protected cloud bucket (ideally on s3, but any should be fine), that we can write these INALT60 recipes to. Collectively, they represent about 550 GBs. (Around 50 for the surface data, and 500 for the interiors.)

Are you aware of any existing password-protected endpoints in the greater Pangeo Forge universe that fit these requirements?

@roxyboy
Copy link
Author

roxyboy commented Jul 7, 2021

in the interim, would you like me to delete all of the INALT60 data from OSN? or would you prefer that i leave it there until we've identified an alternative location for it? (it's up to you; from a technical perspective either is fine.)

The INALT60 crew are ok with keeping the data on OSN until we find a private bucket.

@cisaacstern
Copy link
Member

keeping the data on OSN until we find a private bucket.

Cool. In the interim, would you like me to write the INALT60 interior data to OSN? I have it queued and ready to go. I can also wait until we have the private bucket, but that may be a few weeks from now. It's looking like we will probably need Ryan to set that up.

@roxyboy
Copy link
Author

roxyboy commented Jul 7, 2021

Cool. In the interim, would you like me to write the INALT60 interior data to OSN? I have it queued and ready to go. I can also wait until we have the private bucket, but that may be a few weeks from now. It's looking like we will probably need Ryan to set that up.

I think we can wait for the interior data of INALT60 :)

@rabernat
Copy link
Contributor

I will check with the OSN folks about this.

@cisaacstern
Copy link
Member

Per Ryan's suggestion in #56 (comment), before this recipe is merged, I will rework it to avoid manipulating __defaults__.

@andersy005
Copy link
Member

pre-commit.ci autofix

@auraoupa
Copy link
Contributor

@andersy005 , did you mean to do these changes here #189 ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
swot-adac SWOT Adopt-a-Crossover Dataset
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants