Notes on ‘AI Engineering’ chapter 9: Inference Optimisation
Summary of key concepts for optimizing AI inference performance, covering bottlenecks, metrics, and deployment patterns from Chip Huyen's book.
Summary of key concepts for optimizing AI inference performance, covering bottlenecks, metrics, and deployment patterns from Chip Huyen's book.
Updated LINQ MindMap for .NET 10, tracking new features and changes in the framework's query capabilities.
How to validate your ASP.NET Core Dependency Injection container configuration to catch errors early, ensuring all dependencies can be resolved.
Explores Azure's disaster recovery features for when an entire region is destroyed, covering availability zones, paired regions, and resilience strategies.
A web developer answers questions about blogging history, platform choices, writing process, and favorite posts.
A developer reflects on their blogging journey, platform choices, writing process, and favorite posts, touching on web development and career.
Explains the importance of specifying exact version numbers in Docker and package managers to ensure reproducible builds, stability, and security.
A comprehensive guide to Redux, covering its core principles, use cases, and a full practical example for state management in JavaScript/TypeScript apps.
How Amazon Q Developer simplifies AWS CloudWatch CLI commands by translating natural language queries into precise syntax.
A guide on improving blog post clarity by identifying and editing confusing sentences, focusing on making technical writing understandable.
A guide to creating a plaintext RSS feed using the 11ty static site generator, exploring its benefits for syndication and command-line readers.
A guide on splitting large React components into smaller, focused units to improve maintainability, reusability, and unit test coverage.
Analyzes the future of .NET Framework, estimating its support timeline based on Windows Server lifecycle and official Microsoft documentation.
A developer shares their experience finding and reporting 27 security vulnerabilities on Gitlab through HackerOne in 2019, including insights and motivations.
Explains how to cache CORS preflight requests in .NET applications using the Access-Control-Max-Age header to improve performance.
Explores using Bing Search API to ground LLM responses for website assistants, comparing custom implementation with Azure AI Agent Service.
A guide to automating Azure Quick Review (azqr) scans using a scheduled GitHub Actions workflow for continuous cloud resource assessment.
A summary and discussion of Chapter 1 of Chip Huyen's book, exploring the definition of AI Engineering, its distinction from ML, and the AI Engineering stack.
Explains how Azure's software-defined networking abstracts physical infrastructure, making virtual networks a conceptual layer rather than physical entities.
Explains how to use DebuggerDisplayAttribute to add debugging capabilities to third-party .NET types you can't modify directly.