May be a bug?

https://github.com/THUDM/CogVideo/blob/main/finetune/models/cogvideox_t2v/lora_trainer.py#L111

Why `first_frame.repeat(1, 1, ncopy, 1, 1)`? 

I think that it should be repeating `patch_size_t - ncopy` times so that the `assert latent.shape[2] % patch_size_t == 0` can be realized. Here the patch_size_t is set to 2, so it may not raise an error. But if it is 4, and the latent't frame is 13, then it causes error.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

May be a bug? #782

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

May be a bug? #782

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions