AI Inference at Scale: Reliability, Observability, Cost, and Sustainability