Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

enable compression #1085

Merged
merged 5 commits into from
Jul 27, 2022
Merged

enable compression #1085

merged 5 commits into from
Jul 27, 2022

Conversation

ksunden
Copy link
Member

@ksunden ksunden commented Jul 19, 2022

Changes

Enable passing compression/compression opts/shuffle arguments to h5py require_dataset

Initially this is posted to allow testing in bluesky-in-a-box, so draft without any tests/docs added yet

Closes #422

Checklist

  • added tests, if applicable
  • updated documentation, if applicable
  • updated CHANGELOG.md
  • tests pass

@ksunden
Copy link
Member Author

ksunden commented Jul 19, 2022

Dataset is (51, 61, 551) with array detector for the last axis, all other datasets are just 51x61

original file (without any compression): 34 MB
repacked: 29 MB
repacked and compressed: 1.8 MB

compression while collecting: 7.0 MB
repacked from compression while collecting (still compressed): 2.0 MB

@@ -31,6 +33,13 @@
d.create_channel(name="w1")


def test_create_compressed_channel():
data = wt.Data()
child1 = data.create_channel("hi", shape=(1024, 1024), compression="gzip")

Check notice

Code scanning / CodeQL

Unused local variable

The value assigned to local variable 'child1' is never used.
@ksunden ksunden marked this pull request as ready for review July 26, 2022 22:32
Copy link
Contributor

@kameyer226 kameyer226 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Win10 checks out

The test script needs ability to verify data integrity between uncompressed and compressed files.

@ksunden
Copy link
Member Author

ksunden commented Jul 27, 2022

honestly I'm not interested in testing data integrity, that is h5py's job, which I trust them greatly to do... The test is simply proving that it is indeed applied to begin with.

@kameyer226
Copy link
Contributor

ok

@kameyer226 kameyer226 merged commit f86bf00 into master Jul 27, 2022
@kameyer226 kameyer226 deleted the compression_test branch July 27, 2022 21:26
@ksunden ksunden restored the compression_test branch August 2, 2022 21:30
@ddkohler ddkohler deleted the compression_test branch March 25, 2024 18:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

dataset compression
3 participants