Eugene Yan

Eugene Yan is a Principal Applied Scientist at Amazon, building AI-powered recommendation systems and experiences. He shares insights on RecSys, LLMs, and applied machine learning, while mentoring and investing in ML startups.

https://eugeneyan.com

RSS Feed

1/22/2026

AI machine learning recommendation systems LLMs applied science

Articles from this Blog

185 articles from this blog

11/23/2025 • EN

Product Evals in Three Simple Steps

A guide to building product evaluations for LLMs using three steps: labeling data, aligning evaluators, and running experiments.

LLM Evaluation AI Alignment Data Labeling

10/19/2025 • EN

Advice for New Principal Tech ICs (i.e., Notes to Myself)

A principal engineer shares advice for new principal tech ICs, covering role definition, shifting responsibilities, and the importance of influence and communication.

software engineering career development technical leadership

9/14/2025 • EN

Training an LLM-RecSys Hybrid for Steerable Recs with Semantic IDs

Explores training a hybrid LLM-recommender system using Semantic IDs for steerable, explainable recommendations.

llm Language Models Recommender Systems

6/22/2025 • EN

Evaluating Long-Context Question & Answer Systems

Explores challenges and methods for evaluating question-answering AI systems when processing long documents like technical manuals or novels.

benchmarking Information Retrieval LLM Evaluation

6/4/2025 • EN

AI Engineer 2025 - Improving RecSys & Search with LLM techniques

A presentation on using Large Language Model (LLM) techniques to enhance Recommendation Systems (RecSys) and Search, from the AI Engineer World's Fair 2025.

production llm Search

5/4/2025 • EN

Building News Agents for Daily News Recaps with MCP, Q, and tmux

A technical guide on building automated news agents using MCP, Amazon Q CLI, and tmux to generate daily news recaps from RSS feeds.

mcp tmux Amazon Q

4/20/2025 • EN

An LLM-as-Judge Won't Save The Product—Fixing Your Process Will

Argues that effective AI product evaluation requires a scientific, process-driven approach, not just adding LLM-as-judge tools.

product development data analysis LLM Evaluation

3/18/2025 • EN

NVIDIA GTC 2025 - Building LLM-Powered Applications

Summary of a panel discussion at NVIDIA GTC 2025 on insights and lessons learned from building real-world LLM-powered applications.

production llm generative ai

3/16/2025 • EN

Improving Recommendation Systems & Search in the Age of LLMs

Explores how large language models (LLMs) are transforming industrial recommendation systems and search, covering hybrid architectures, data generation, and unified frameworks.

llm recommendation systems Search

1/12/2025 • EN

Building AI Reading Club: Features & Behind the Scenes

A developer builds an AI-powered reading companion called Dewey, detailing its features, design, and technical implementation.

llm prototyping AI Reading Companion

11/24/2024 • EN

How to Run a Weekly Paper Club (and Build a Learning Community)

A guide on starting and running a weekly paper club for learning about AI/ML research papers and building a technical community.

Machine Learning community building AI Research

11/17/2024 • EN

My Minimal MacBook Pro Setup Guide

A guide to setting up a new MacBook Pro for development with minimal tools, including OS tweaks, terminal setup, and essential software.

developer tools productivity terminal

11/3/2024 • EN

39 Lessons on Building ML Systems, Scaling, Execution, and More

Key lessons from 2024 ML conferences on building effective machine learning systems, covering reward functions, trade-offs, and practical engineering advice.

Machine Learning leadership Scaling

10/27/2024 • EN

AlignEval: Building an App to Make Evals Easy, Fun, and Automated

Introduces AlignEval, an app for building and automating LLM evaluators, making the process easier and more data-driven.

ai testing LLM Evaluation Automated Evals

9/22/2024 • EN

Weights & Biases LLM-Evaluator Hackathon - Hackathon Judge

Author judges a Weights & Biases hackathon focused on building LLM evaluation tools, discussing key considerations and project highlights.

large language models hackathon LLM Evaluation

9/8/2024 • EN

Building the Same App Using Various Web Frameworks

A developer compares building a simple CRUD web app using FastAPI, FastHTML, Next.js, and SvelteKit to evaluate their features and developer experience.

nextjs crud fastapi

8/18/2024 • EN

Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge)

A survey of using LLMs as evaluators (LLM-as-Judge) for assessing AI model outputs, covering techniques, use cases, and critiques.

large language models LLM Evaluation AI Evaluation

7/7/2024 • EN

How to Interview and Hire ML/AI Engineers

A guide to designing a reliable and valid interview process for hiring machine learning and AI engineers, covering technical skills, data literacy, and interview structure.

Machine Learning software engineering artificial intelligence

6/27/2024 • EN

AI Engineer 2024 Keynote - What We Learned from a Year of LLMs

Reflections on delivering the closing keynote at the AI Engineer World's Fair 2024, sharing lessons from a year of building with LLMs.

software development production llm

5/31/2024 • EN

Netflix PRS 2024 - Applying LLMs to Recommendation Experiences

A summary of a talk on applying Large Language Models (LLMs) to build and deploy recommendation systems at scale, presented at Netflix's PRS workshop.

Machine Learning llm recommendation systems

1 2 3 4 5 ... 10 Next