Skip to content

About

I design and build scalable AI infrastructure used by engineering teams at enterprise scale.

Most recently, I led development of a multi-cloud AI Gateway enabling 100+ teams to securely access large language models across AWS, Azure, and GCP — processing over 1B tokens per day with built-in governance, resiliency, and cost controls.

coding dev illustration

Tech Stack

I have experience across various aspects of full-stack development. In my past roles, I worked on event-driven microservices built with TypeScript and AWS Lambda.

In my current role, I work extensively with LLM APIs, creating example applications and demonstrating how to integrate agentic RAG techniques into real-world use cases.

My work focuses on: • Distributed systems and platform architecture • Enterprise AI access and governance • Reliability, observability, and cost optimization • Standardizing infrastructure for cross-team adoption

I’m particularly interested in building internal platforms that simplify complexity and create long-term leverage for engineering organizations.

Technical Focus

I work primarily at the intersection of AI systems and cloud-native infrastructure.

AI & LLM Systems

Cloud & Infrastructure

Observability & Data

Application Layer

You can reach out to me via email.

View my Resume