2/7/2025
•
EN
Notes on ‘AI Engineering’ chapter 9: Inference Optimisation
Summary of key concepts for optimizing AI inference performance, covering bottlenecks, metrics, and deployment patterns from Chip Huyen's book.