Production articles

6/4/2025 • EN

AI Engineer 2025 - Improving RecSys & Search with LLM techniques

A presentation on using Large Language Model (LLM) techniques to enhance Recommendation Systems (RecSys) and Search, from the AI Engineer World's Fair 2025.

AI Engineering llm production Recsys Search

Eugene Yan

3/18/2025 • EN

NVIDIA GTC 2025 - Building LLM-Powered Applications

Summary of a panel discussion at NVIDIA GTC 2025 on insights and lessons learned from building real-world LLM-powered applications.

Engineering generative ai llm Nvidia Gtc production

Eugene Yan

10/15/2024 • EN

Should you use uv's managed Python in production?

Analyzes the viability of using uv's managed Python in production, covering portability, performance, and security implications.

deployment packaging production Python uv

Itamar Turner Trauring

8/28/2024 • EN

Production-ready Python Docker Containers with uv

A guide to building fast, production-ready Docker containers for Python applications using the uv tool, focusing on multi-stage builds and caching strategies.

Containerization docker production Python uv

Hynek Schlawack

8/27/2024 • EN

Taking your AI application from concept to production

A guide on transitioning Generative AI applications from proof-of-concept to production, covering architecture, security, and operations.

Architecture Azure Openai generative ai Mlop production

Simon Waight

6/27/2024 • EN

AI Engineer 2024 Keynote - What We Learned from a Year of LLMs

Reflections on delivering the closing keynote at the AI Engineer World's Fair 2024, sharing lessons from a year of building with LLMs.

AI Engineering Keynote llm production software development

Eugene Yan

10/9/2023 • EN

AI Engineer 2023 Keynote - Building Blocks for LLM Systems

A summary of a keynote talk on essential building blocks for production LLM systems, covering evaluations, RAG, and guardrails.

AI Engineering Evaluations llm production Retrieval Augmented Generation

Eugene Yan

1/4/2023 • EN

Fixing a Memory Leak in a Production Node.js App

A developer details their journey to diagnose and fix a persistent memory leak in their production Node.js application after a database migration.

mdx-bundler memory leak Node.js production sqlite

Kent C. Dodds

5/15/2022 • EN

Ultimate Guide: NestJS Dockerfile For Production [2022]

A step-by-step tutorial on creating a production-optimized Dockerfile for NestJS applications, covering local testing and deployment.

docker dockerfile Nestjs Node.js production

Tom Ray

7/13/2021 • EN

SF Big Analytics - System Design for RecSys & Search

A talk on system design for recommendation and search systems, covering architecture and production considerations.

Machine Learning production Recsys Search system design

Eugene Yan

6/27/2021 • EN

How to Start a Production-Ready Django Project

A guide to setting up a new Django project with a focus on organization, environment separation, and production readiness.

configuration django Environments production project setup

Vitor Freitas

3/23/2021 • EN

Deploy A Site Live

Learn how to deploy a Django site live, including choosing a production-ready Python application server like Gunicorn.

deployment django production web server Wsgi

Matt Layman

7/5/2020 • EN

My Notes From Spark+AI Summit 2020 (Application-Specific Talks)

Notes from Spark+AI Summit 2020 covering application-specific talks on ML frameworks, data engineering, feature stores, and data quality from companies like Airbnb and Netflix.

Data Engineering Feature Engineering Machine Learning production Spark

Eugene Yan

5/25/2020 • EN

A tour of Django server setups

A guide to common Django server setups, from simple local development to professional production deployments with Gunicorn and PostgreSQL.

deployment django Gunicorn production Wsgi

Matt Segal

5/25/2020 • EN

A Practical Guide to Maintaining Machine Learning in Production

A guide to best practices for monitoring, maintaining, and managing machine learning models and data pipelines in a production environment.

Data Validation Machine Learning Mlop Monitoring production

Eugene Yan

5/18/2020 • EN

6 Little-Known Challenges After Deploying Machine Learning

Explores six unexpected challenges that arise after deploying machine learning models in production, from data schema changes to organizational issues.

Data Quality deployment Machine Learning Mlop production

Eugene Yan