I would like to verify whether the data preprocessing steps for MAE (Masked Autoencoder) feature extraction during inference are identical to those used during MAE pretraining. Could you clarify if the same preprocessing pipeline should be applied, or are there specific differences to consider for inference?