Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for I2V for LTX? #187

Closed
ArEnSc opened this issue Jan 6, 2025 · 2 comments
Closed

Support for I2V for LTX? #187

ArEnSc opened this issue Jan 6, 2025 · 2 comments

Comments

@ArEnSc
Copy link
Contributor

ArEnSc commented Jan 6, 2025

Feature request / 功能建议

How difficult would it be to support I2V training?
I am new to video transformers diffusion and more of a generalist LLM Engineer with Application Experience.

  1. I suspect it would be as simple as taking the input video which will be the target latent.

  2. The sample would be a copy of the latent.
    Mask out everything other than the first frame.
    Sample noise for the remaining frames.
    Add a binary mask to the latent.

  3. The diffusion transformer predicts the noise to be removed from the sample latent given the first frame using the target.

Does this sound correct?
Or do I have this wrong @sayakpaul @a-r-r-o-w

Motivation / 动机

Want to make a PR for this

Your contribution / 您的贡献

None as of yet still having conversations

@sayakpaul
Copy link
Collaborator

#150

@ArEnSc
Copy link
Contributor Author

ArEnSc commented Jan 6, 2025

That conversation was very enlightening there is some stuff I need to understand thanks~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants