Sotabase
Home
Researchers
Career
·
Associate Professor
,
University of Washington, Paul G. Allen School of Computer Science & Engineering
2023–
Publications
(105)
Foreshadow: Extracting the Keys to the Intel SGX Kingdom with Transient Out-of-Order Execution
USENIX Security Symposium · 2018
1,145
cited
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Conference on Machine Learning and Systems · 2023
249
cited
Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
International Conference on Machine Learning · 2024
237
cited
Foreshadow-NG: Breaking the virtual memory abstraction with transient out-of-order execution
2018
195
cited
NDA: Preventing Speculative Execution Attacks at Their Source
Micro · 2019
143
cited
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
Conference on Machine Learning and Systems · 2025
139
cited
Data races vs. data race bugs: telling the difference with portend
ASPLOS XVII · 2012
125
cited
RaceMob: crowdsourced data race detection
Symposium on Operating Systems Principles · 2013
107
cited
REPT: Reverse Debugging of Failures in Deployed Software
USENIX Symposium on Operating Systems Design and Implementation · 2018
101
cited
DOLMA: Securing Speculation with the Principle of Transient Non-Observability
USENIX Security Symposium · 2021
92
cited
Cntr: Lightweight OS Containers
USENIX Annual Technical Conference · 2018
91
cited
Failure sketching: a technique for automated root cause diagnosis of in-production failures
Symposium on Operating Systems Principles · 2015
90
cited
I4: incremental inference of inductive invariants for verification of distributed protocols
Symposium on Operating Systems Principles · 2019
82
cited
NanoFlow: Towards Optimal Large Language Model Serving Throughput
arXiv.org · 2024
76
cited
A Hypervisor for Shared-Memory FPGA Platforms
International Conference on Architectural Support for Programming Languages and Operating Systems · 2020
69
cited
Morpheus: A Vulnerability-Tolerant Secure Architecture Based on Ensembles of Moving Target Defenses with Churn
International Conference on Architectural Support for Programming Languages and Operating Systems · 2019
61
cited
I-SPY: Context-Driven Conditional Instruction Prefetching with Coalescing
Micro · 2020
48
cited
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models
International Conference on Learning Representations · 2024
47
cited
Lazy Diagnosis of In-Production Concurrency Bugs
Symposium on Operating Systems Principles · 2017
45
cited
MOESI-prime: preventing coherence-induced hammering in commodity workloads
International Symposium on Computer Architecture · 2022
44
cited
Show all 105 papers →
Sotabase