Support for I2V for LTX? #187

ArEnSc · 2025-01-06T15:38:09Z

Feature request / 功能建议

How difficult would it be to support I2V training?
I am new to video transformers diffusion and more of a generalist LLM Engineer with Application Experience.

I suspect it would be as simple as taking the input video which will be the target latent.
The sample would be a copy of the latent.
Mask out everything other than the first frame.
Sample noise for the remaining frames.
Add a binary mask to the latent.
The diffusion transformer predicts the noise to be removed from the sample latent given the first frame using the target.

Does this sound correct?
Or do I have this wrong @sayakpaul @a-r-r-o-w

Motivation / 动机

Want to make a PR for this

Your contribution / 您的贡献

None as of yet still having conversations

sayakpaul · 2025-01-06T16:07:03Z

#150

ArEnSc · 2025-01-06T17:00:46Z

That conversation was very enlightening there is some stuff I need to understand thanks~

ArEnSc closed this as completed Jan 6, 2025

sayakpaul mentioned this issue Jan 8, 2025

ltx i2v trainer #138

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for I2V for LTX? #187

Support for I2V for LTX? #187

ArEnSc commented Jan 6, 2025

sayakpaul commented Jan 6, 2025

ArEnSc commented Jan 6, 2025

Support for I2V for LTX? #187

Support for I2V for LTX? #187

Comments

ArEnSc commented Jan 6, 2025

Feature request / 功能建议

Motivation / 动机

Your contribution / 您的贡献

sayakpaul commented Jan 6, 2025

ArEnSc commented Jan 6, 2025