When I use Swin-TransformerV2 to train my own dataset, the resolution of each image is 64×64 after data processing. However, it seems that I can't directly input them into the model as following error:

What should I do to ensure I can continue training? Could you please provide some suggestions?