arxiv:2403.15042
Nicholas Lee
nicholaslee
AI & ML interests
NLP, Speech, Efficient ML, Generative
Recent Activity
upvoted
a
paper
about 4 hours ago
Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
upvoted
a
paper
4 months ago
XQuant: Breaking the Memory Wall for LLM Inference with KV Cache
Rematerialization