I am a distributed systems engineer at Nvidia DGX Cloud, working on availability and scalability in data center automation.
Summary:
-
Interests:
During my PhD at the University of Maryland College Park, I studied dynamic software updates for systems requiring high availability. My dream was (and is!) to eliminate all downtime in running systems.
My current interests are optimizing and automating cloud infrastructure for AI and machine learning workloads. I help create innovative solutions that enhance scalability, efficiency, and availability, particularly in containerized and serverless systems.
-
Professional Experience:
Prior to joining Nvidia in October 2024, I was a Principal Research SDE in Gray Systems Lab (GSL) at Microsoft. I also have worked on the Azure Kubernetes Service (AKS) infrastructure team and at Intel Labs as a research scientist. Before my PhD, I worked in the information security field in the Maryland/DC area.