You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…
This project implements an efficient scheduling system for Large Language Model (LLM) inference, as described in the paper "Efficient LLM Scheduling by Learning to Rank"