Phi-3 4k model include in all responses the end token "<|end|>" Im using: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf and llama.cpp for docker cuda server in the latest version. Thanks in advance.