-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FR: TAE for SVD #14
Comments
Since each frame after the first should only be a difference from the previous frame (not an entirely new frame), is it viable to first decode the first frame with the original SVD vae (should only take about the same memory/speed as undistilled regular SD vae in this context) then decode the rest on a vae that is specifically trained on the difference between the current and previous frame in the latents, outputting a difference map to be applied to the last decoded frame? Kinda like a video codec. Moves away from the simplicity of decoding normally though |
Hi @madebyollin, thanks for your work! I am curious to know if there are any updates regarding the TAESVD model.
Hi @Ednaordinary, do you find some alternative ways or repos to speed up the decoding process for SVD? |
I've uploaded my initial TAESDV checkpoint + code to https://github.com/madebyollin/taesdv. It's still a bit WIP (see the TODOs in the README) but it should be capable of decoding much smoother videos than single-frame TAESD (while still being really fast). |
SVD can now get to really fast speeds step wise but is limited by the slow speed of the vae. Any way to distill the temporal spatial auto encoder the same way as the regular auto encoder?
The text was updated successfully, but these errors were encountered: