-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make Accumulators
and ScalarValue
serializable
#11369
Comments
Should we support serialization in substrait? Maybe extend it to serialize AggregateExec? datafusion/datafusion/substrait/src/physical_plan/producer.rs Lines 122 to 124 in 16a3148
|
The way we solved this was to add a sister method to the This may be something that could live in upstream if there is sufficient interest in it (it doesn't seem to overfit to any particular usage of DF). I will monitor this issue and we may patch this stuff to upstream DF if appropriate. |
we'd be very interested in this @ozankabak |
Not super familiar with the subtrait project but would this also serialize the state of the accumulators? |
I guess so. https://substrait.io/ There is also another issue for physical expr serialization |
Is your feature request related to a problem or challenge?
When running continuous computations, we'd like to snapshot the state of our operators many of which use accumulators. This would be key to make computations over continuous streams a first class citizen in DataFusion (see #11365 ).
Describe the solution you'd like
Addition of a
SerializableAccumulator
trait -as well as a method on the
Accumulator
trait -This would mean
ScalarValue
also needs to implement serialization to [u8]. We have a POC PR on our fork of DataFusion for this.Would love to hear feedback from the community on this proposal.
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: