Deploy Mixtral 8x7B on AWS Inferentia2 with Hugging Face Optimum
A technical guide on deploying the Mixtral 8x7B LLM on AWS Inferentia2 using Hugging Face Optimum and Amazon SageMaker.
A technical guide on deploying the Mixtral 8x7B LLM on AWS Inferentia2 using Hugging Face Optimum and Amazon SageMaker.
Explains HSTS (HTTP Strict Transport Security), its role in ASP.NET Core apps for enforcing HTTPS, and configuration considerations.
A CSS expert discusses modern layout features like Grid, anchor positioning, and masonry, and why adoption of new CSS is slower than expected.
An update on Meridian, a Swift on the Server web framework inspired by SwiftUI, covering async/await support and WebSocket integration.
A guide to fixing cropped Mermaid diagrams in Obsidian by adjusting CSS styles for proper diagram display.
A tutorial on implementing a local RAG system using Phi-3, Semantic Kernel, and TextMemory in a C# console application.
A technical guide on implementing per-user discounts for a Shopify store, inspired by Bungie's system, using a custom app and database.
Learn how to change your Mac's system date from the command line for testing date-based logic in applications.
A developer shares their curated list of must-watch technical sessions from Apple's WWDC24, focusing on Swift, Xcode, StoreKit, and UI frameworks.
An introduction to Valibot, a new modular data validation library for JavaScript, exploring its origins and core concepts.
Explains how Entity Framework protects against SQL injection in LINQ queries but warns of risks when using raw SQL, highlighting safer alternatives.
A tutorial on implementing loading states in React forms using actions, including the useActionState hook.
Explains how to display file modification times in C++ using OS-specific APIs and the modern C++20 std::filesystem/std::chrono approach.
Explains the limitations of Large Language Models (LLMs) and introduces Retrieval Augmented Generation (RAG) as a solution for incorporating proprietary data.
A developer's reflections on WWDC 2024, exploring new Apple technologies like SwiftData, AI features, Sequoia, and Reality Kit for future projects.
Analysis of React 19's Suspense changes causing unexpected waterfalls and parallel loading issues, based on discoveries from the React Summit conference.
A podcast episode discussing the value of a developer's time and the balance between offering free services and charging for work.
Analyzes public reactions to AI bias claims, contrasting them with responses to traditional software bugs, using a viral example.
A tutorial on the six most fundamental R functions for data cleaning, using the tidyverse and palmerpenguins dataset.
Final part of a series on building enterprise-grade AI chat applications, covering data customization, UI, security, evaluation, scaling, load testing, and monitoring.