added lazy loading option for nwb loading function #313

vigji · 2024-07-15T06:47:26Z

I was missing the option of changing the lazy loading behavior of NWB files when going through the function. This PR is a very minimal parameter piping from the function to the underlying NWBFile object instantiation.

More general question about this function, for which I lack the broader context of where the package is going: it sounds funny to have the same function for such different constructs: on the one side, nwb files which are equivalent to whole collections of Tsds/TsdFrames/TsdTensors, on the other, npc files which from what I understand contain individually one of those items and as a consequence lacks a lazy_loading behavior (even though that could be implemented by returning the object and not the object.load() output; but this would either change the current default for load_file() on the NWB, by default true, and for the npz file, by default false).

Let me know what do you think about this!

gviejo · 2024-07-15T16:40:09Z

Reading your questions makes me thinks there should be a nap.load_nwb_file and nap.load_npz_file. That would make things cleaner.
Would you be willing to write those functions in the PR and mark nap.load_file as deprecated?

gviejo · 2024-07-15T16:43:01Z

Alternatively, the lazy_loading argument could be propagated to np.load throughs the mmap_mode argument. It supports memory map if I understand correctly the documentation.

BalzaniEdoardo · 2024-07-15T18:43:29Z

I would vote for the second option ("mmap_mode") so that people do not have to remember two different commands for loading. I think it looks cleaner.

vigji · 2024-07-15T20:07:17Z

@BalzaniEdoardo I do not know your current preferred usage of the package (for nwbs or folders of npzs) but it is a bit strange to keep the same function to load such different things (either whole datasets or parts of them).

I don't mind particularly, happy to evolve this PR in either direction, but from a design POV what @gviejo proposed (split and deprecate) feels definitively cleaner

gviejo · 2024-07-15T20:36:15Z

I think best decision here is to give us minimal work. I would choose to keep a single nap.load_file. Numpy np.load_file does the same by taking either .npz or .npy.

vigji · 2024-07-15T21:05:36Z

Ok! what about default of lazy mode then? True would change current behavior for .npz, False would change behavior for .nwb.

gviejo · 2024-07-16T17:52:25Z

Ok so default is None and you go by default for lazy loading for NWB and not for NPZ. Changing this to True or False forces the behavior.

vigji · 2024-07-16T22:35:23Z

Ok!
Implemented required behavior in load_file, but also added a refactoring proposal for NPZFile
Here is a draft, if you can have a look. The idea is to have a lazy loader object for the npz file, which in principle could have been a drop in replacement for the old class. However, I also noticed that the syntax for both the determination and in particular the instantiation was quite convoluted.

Everything can be very streamlined if we move the constructors from the file to be class methods of each relative object. I think this makes lot of sense as it binds more tightly together the save and the loading. Does this looks reasonable?

BalzaniEdoardo

This is a good idea, but I think having a class method for each object is too much. I think we should have a single method for all the time-series, implemented in Base.

I added the code in the comments but it should look something like this

    @classmethod
    def _from_npz_reader(cls, file)
        kwargs = {key: val for key, val in file.items() if key not in ["start", "end", "type"]}
        iset = nap.IntervalSet(start=file["start"], end=file["end"])
        return cls(time_support=iset, **kwargs)

pynapple/core/time_series.py

BalzaniEdoardo

I think you should add tests to the npz loader to check that npz could be memmapped for real. my worry is that the memmap parameter of load will be ignored for npz, but this needs to be tested.

Independently, I still think the classmethod for loading the time series is quite elegant, and can simplify the io.interface_npz.py.

All the interface has to do is figure out the type, and call the class method.

pynapple/io/interface_npz.py

vigji · 2024-07-17T08:42:28Z

This should be done! Alas, no lazy mode for npz files. I really could not figure that out.

The interface looks cleaner now, so, it was not for nothing :)

A note: I am not sure to which degree there should be support for npz files without a type specification. If this is there for legacy files only, I would raise a warning and deprecate in the long term.

tests/npzfilestest/tsd2.json

pynapple/io/interface_npz.py

pynapple/core/interval_set.py

pynapple/io/interface_npz.py

Co-authored-by: Guillaume Viejo <[email protected]>

vigji · 2024-07-17T21:14:45Z

Ok, everything should be good now!

Let me know about that point on the deprecation, if you want I can add the warning in this PR. Otherwise should be ready for merging

gviejo · 2024-07-18T23:13:26Z

Thanks Luigi

BalzaniEdoardo · 2024-07-18T23:22:21Z

I think you are right about that, the type parameters should be deprecated and removed.

…

On Wed, Jul 17, 2024, 5:15 PM Luigi Petrucco ***@***.***> wrote: Ok, everything should be good now! Let me know about that point on the deprecation, if you want I can add the warning in this PR. Otherwise should be ready for merging — Reply to this email directly, view it on GitHub <#313 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AG2MOS37RZAYLO4UQCDSVILZM3NFZAVCNFSM6AAAAABK35R7LGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEMZUGMZDKMBZGA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

added lazy loading option for nwb loading function

f6b11d3

vigji requested a review from gviejo as a code owner July 15, 2024 06:47

vigji added 2 commits July 17, 2024 00:26

npz refactoring and lazy loading

c5f8cfb

small impros

358892e

vigji added 3 commits July 17, 2024 00:37

Merge remote-tracking branch 'upstream/dev'

769205e

blaked

c566989

fixed tests

9079ddc

BalzaniEdoardo requested changes Jul 16, 2024

View reviewed changes

pynapple/core/time_series.py Outdated Show resolved Hide resolved

BalzaniEdoardo reviewed Jul 17, 2024

View reviewed changes

pynapple/io/interface_npz.py Outdated Show resolved Hide resolved

vigji added 2 commits July 17, 2024 09:35

Moved class method to base class

295dfac

Final cleanup and load_file test in lazy_load

ae2aa3e

vigji added 2 commits July 17, 2024 10:43

Comment removed

d68f348

linting

d469536

vigji requested a review from BalzaniEdoardo July 17, 2024 09:03

vigji added 2 commits July 17, 2024 11:04

isorted black

ed2726a

Added option for closing an open file

5ed5b76

gviejo requested changes Jul 17, 2024

View reviewed changes

vigji and others added 4 commits July 17, 2024 22:57

removed file from test

804565e

ignore folder generated during tests

b302e7d

Update pynapple/io/interface_npz.py

1be64e3

Co-authored-by: Guillaume Viejo <[email protected]>

Update pynapple/io/interface_npz.py

dbea5ac

Co-authored-by: Guillaume Viejo <[email protected]>

vigji added 2 commits July 17, 2024 23:13

blacked

797bb0a

Merge branch 'main' of https://github.com/iurillilab/pynapple

7cd48f2

vigji mentioned this pull request Jul 17, 2024

from_npz() method for loading #169

Closed

gviejo approved these changes Jul 18, 2024

View reviewed changes

gviejo merged commit ef9dc1a into pynapple-org:dev Jul 18, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added lazy loading option for nwb loading function #313

added lazy loading option for nwb loading function #313

vigji commented Jul 15, 2024

gviejo commented Jul 15, 2024

gviejo commented Jul 15, 2024

BalzaniEdoardo commented Jul 15, 2024

vigji commented Jul 15, 2024

gviejo commented Jul 15, 2024

vigji commented Jul 15, 2024

gviejo commented Jul 16, 2024

vigji commented Jul 16, 2024

BalzaniEdoardo left a comment •

edited

Loading

BalzaniEdoardo left a comment

vigji commented Jul 17, 2024

vigji commented Jul 17, 2024

gviejo commented Jul 18, 2024

BalzaniEdoardo commented Jul 18, 2024 via email

added lazy loading option for nwb loading function #313

added lazy loading option for nwb loading function #313

Conversation

vigji commented Jul 15, 2024

gviejo commented Jul 15, 2024

gviejo commented Jul 15, 2024

BalzaniEdoardo commented Jul 15, 2024

vigji commented Jul 15, 2024

gviejo commented Jul 15, 2024

vigji commented Jul 15, 2024

gviejo commented Jul 16, 2024

vigji commented Jul 16, 2024

BalzaniEdoardo left a comment • edited Loading

Choose a reason for hiding this comment

BalzaniEdoardo left a comment

Choose a reason for hiding this comment

vigji commented Jul 17, 2024

vigji commented Jul 17, 2024

gviejo commented Jul 18, 2024

BalzaniEdoardo commented Jul 18, 2024 via email

BalzaniEdoardo left a comment •

edited

Loading