Quarter 2: ML Infrastructure

Studied MLOps: experiment tracking (W&B), model serving (TorchServe, TGI) - Learned about GPU profiling, memory optimization, quantization - Built an internal tool for experiment comparison at current company - Milestone: Reduced model serving latency by 40% at work