Fix shape descriptions in calculate_loss method #1204

yuta0x89 · 2024-01-09T08:30:34Z

The content of latents is defined in the following code in modeling_sd_base.py:

latents = self.prepare_latents(
    batch_size * num_images_per_prompt,
    num_channels_latents,
    height,
    width,
    prompt_embeds.dtype,
    device,
    generator,
    latents,
)

As a result, the shape of latents and next_latents mentioned in the description of the calculate_loss method in the DDPOTrainer class should be [batch_size, num_channels_latents, height, width].

lvwerra · 2024-01-09T13:12:09Z

Letting @sayakpaul have a look too.

Fix shape descriptions in calculate_loss method

9aa33e2

lvwerra requested a review from kashif January 9, 2024 13:11

sayakpaul approved these changes Jan 9, 2024

View reviewed changes

kashif approved these changes Jan 9, 2024

View reviewed changes

kashif merged commit b181e40 into huggingface:main Jan 9, 2024
1 check failed

lapp0 pushed a commit to lapp0/trl that referenced this pull request May 10, 2024

Fix shape descriptions in calculate_loss method (huggingface#1204)

29cf6c8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix shape descriptions in calculate_loss method #1204

Fix shape descriptions in calculate_loss method #1204

yuta0x89 commented Jan 9, 2024

lvwerra commented Jan 9, 2024

Fix shape descriptions in calculate_loss method #1204

Fix shape descriptions in calculate_loss method #1204

Conversation

yuta0x89 commented Jan 9, 2024

lvwerra commented Jan 9, 2024