Doctoral Student at InfiniAI lab, CMU ECE; Ex Research Fellow at MSR India
-
Carnegie Mellon University
- Pittsburgh, PA
- ranonrkm.github.io
Pinned Loading
-
DiskANN
DiskANN PublicForked from microsoft/DiskANN
Scalable graph based indices for approximate nearest neighbor search
C++ 1
-
-
Infini-AI-Lab/MagicDec
Infini-AI-Lab/MagicDec Public[ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding
-
Infini-AI-Lab/Kinetics
Infini-AI-Lab/Kinetics PublicKinetics: Rethinking Test-Time Scaling Laws
-
Infini-AI-Lab/MagicDec-part1
Infini-AI-Lab/MagicDec-part1 PublicSpeculative decoding for high-throughput long-context inference
JavaScript
-
Infini-AI-Lab/MagicDec-part2
Infini-AI-Lab/MagicDec-part2 PublicMagicDec: Breaking the Latency-Throughput Tradeoff for Long Contexts with Speculative Decoding
JavaScript
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


