LLM Model Serving on Autopilot
A guide to deploying and running your own LLM on Google Kubernetes Engine (GKE) Autopilot for control, privacy, and cost management.
A guide to deploying and running your own LLM on Google Kubernetes Engine (GKE) Autopilot for control, privacy, and cost management.
Explains why creating a new branch for pull requests, instead of using main, prevents workflow issues and conflicts in Git.
A guide on customizing comment colors in VS Code themes using token color customizations in settings.json.
How to cordon and drain nodes in GKE Autopilot for node replacement, including a disruptive cluster-wide method.
A recap of Java's major developments in 2023, including JDK 21, virtual threads, and improved developer tooling and accessibility.
A technical tutorial on fine-tuning the Llama 2 7B large language model using AWS Trainium instances and Hugging Face libraries.
Explains new and existing CSS techniques for vertically centering elements, including upcoming browser support for simpler methods.
Explores using multimodal vision AI models like LLaVA for advanced UI/UX test automation, moving beyond traditional methods.
A developer works on the sign-up template, styling, and feature flags for a SaaS product using Python and Django.
Explores the potential new 'params collections' feature in C# 13, allowing the params keyword to be used with collection types beyond arrays.
Learn how to create text outlines using pure CSS with the -webkit-text-stroke properties, including hover effects and browser support.
A guide to using React 18's useOptimistic hook for instant UI updates in Next.js applications, improving perceived performance.
Explains a common MySQL UPDATE error and provides a workaround using a derived table or jOOQ to bypass the restriction.
Guide to upgrading a GKE Autopilot cluster to version 1.28 to enable CUDA 12 support for NVIDIA GPU workloads.
Explains how to enable Google GKE Image Streaming for public DockerHub images using a remote Artifact Registry repository as a mirror.
A guide to implementing quick, keyboard-first search in SwiftUI using the .searchable modifier, with platform-specific solutions for iOS and macOS.
A quick guide to finding the NVIDIA GPU driver version running on a Google Kubernetes Engine (GKE) cluster using a kubectl command.
Learn to programmatically manage Hugging Face Inference Endpoints using the huggingface_hub Python library for automated model deployment.
A technical guide on deploying Antora documentation using GitHub Actions with authentication for private GitHub repositories.
A software engineer's chronological career retrospective, detailing experiences with COBOL, C#, VR, and modern DevOps across various European tech roles.