I have been trying to find a way to run two instances of skynet which query the same vllm instance accross a docker network.
The idea is that one instance runs vllm, the other doesnt run vllm but querys vllm running on the other instance.
I can setup the docker instances and docker network so that one docker instance can see the instance with vllm running on port 8003 accross the network but I cant get the fastapi endpoints like /summaries to initiate on the instance that vllm is not running on.
Is there some way that I can acheive this?