I saw that MoriEP supports intra-node and inter-node ops, and I have a few questions:
- MoriEP is equivalent to which mode in DeepEP:
throughput or low_latency mode?
- Is MoriEP beneficial in the case no non-PD inferencing?
- In PD inferencing, is MoriEP suitable to be used in Prefill instance , Decode instance or both?