I am a PhD candidate in Computer Science at the University of Illinois at Urbana-Champaign, advised by Ravishankar K. Iyer. I am affiliated with the DEPEND research group at the Coordinated Science Laboratory. My research aims to improve the performance, resilience, and ease of management of large-scale heterogeneous computer systems.
I am particularly interested applying machine learning to challenging problems within the design and management of computer systems. My research has demonstrated the application of domain-knowledge (i.e., knowledge about software and hardware architecture) as an inductive bias to ML techniques to solve a variety of systems problems, including heterogeneous resource management, distributed failure detection, and resilience assessment.
In past summers I have interned at IBM Research, Microsoft Research, and CERN.
News [More Entries]
- Oct 20, 2021 Our paper on characterizing latency variation in serverless FaaS has been accepted at WoSC 2021.
- Aug 20, 2021 Our paper on accelerating PairHMM computations on GPUs has been accepted at ICCD 2021.
- Nov 19, 2020 Our paper on correcting CPU-performance counter sampling errors has been accepted at ASPLOS 2021.
- Sep 5, 2020 Our SC 2020 paper has been nominated for the best paper and best student paper awards.
- Aug 15, 2020 Our paper on fine-grained ML-based resource rebalancing to meet microservice SLO-requirements has been accepted at OSDI 2020.
Selected Publications [Full List: Publications, Projects]
2021
2020
Live Forensics for HPC Systems: A Case Study on Distributed Storage Systems.
Supercomputing 2020.
- Best Paper & Best Student Paper Finalist
FIRM: An Intelligent Fine-Grained Resource Management Framework for SLO-Oriented Microservices.
OSDI 2020.
Inductive-bias-driven Reinforcement Learning for Efficient Schedules in Heterogeneous Clusters.
ICML 2020.
2019
ML-based Fault Injection for Autonomous Vehicles: A Case for Bayesian Fault Injection.
DSN 2019.
AcMC²: Accelerated Markov Chain Monte Carlo for Probabilistic Models.
ASPLOS 2019.
CAUDIT: Continuous Auditing of SSH-Servers To Mitigate Brute-Force Attacks.
NSDI 2019.
2018
ASAP: Accelerated Short Read Alignment on Programmable Hardware.
IEEE Transactions on Computers.