Sotabase
Home
Researchers
Career
·
MS student in Electrical Engineering & Computer Sciences
,
UC Berkeley
Publications
(5)
Squeezed Attention: Accelerating Long Context Length LLM Inference
Annual Meeting of the Association for Computational Linguistics · 2024
35
cited
ETS: Efficient Tree Search for Inference-Time Scaling
arXiv.org · 2025
10
cited
Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
arXiv.org · 2025
Residual Context Diffusion Language Models
2026
TASER: Translation Assessment via Systematic Evaluation and Reasoning
Proceedings of the Tenth Conference on Machine Translation · 2025
Sotabase
Monishwaran Maheswaran | Researcher Profile | Sotabase | Sotabase