I am working on a project that requires the use of multiple scenes, and each scene contains audio. During the video generation, I noticed that only the first scene retained the audio, and the audio I added in the other scenes was ignored. Is this normal?