Eugene Yan

Eugene Yan is a Principal Applied Scientist at Amazon, building AI-powered recommendation systems and experiences. He shares insights on RecSys, LLMs, and applied machine learning, while mentoring and investing in ML startups.

https://eugeneyan.com

RSS Feed

1/22/2026

AI machine learning recommendation systems LLMs applied science

Articles from this Blog

185 articles from this blog

5/26/2024 • EN

Prompting Fundamentals and How to Apply them Effectively

Explains core prompting fundamentals for effective LLM use, including mental models, role assignment, and practical workflow with examples.

large language models prompt engineering Evaluation

5/12/2024 • EN

What We've Learned From A Year of Building with LLMs

A practical guide sharing lessons learned from a year of building real-world applications with Large Language Models (LLMs).

large language models prompt engineering Rag

3/31/2024 • EN

Task-Specific LLM Evals that Do & Don't Work

A guide to effective and ineffective evaluation methods for LLMs on tasks like classification, summarization, and translation, including practical metrics.

classification Translation LLM Evaluation

2/25/2024 • EN

Don't Mock Machine Learning Models In Unit Tests

Explains why mocking ML models in unit tests is problematic and offers guidelines for effectively testing machine learning code.

Python Machine Learning software engineering

2/11/2024 • EN

How to Generate and Use Synthetic Data for Finetuning

Explores methods for generating synthetic data (distillation & self-improvement) to fine-tune LLMs for pretraining, instruction-tuning, and preference-tuning.

llm Finetuning Instruction Tuning

1/7/2024 • EN

Language Modeling Reading List (to Start Your Paper Club)

A curated reading list of fundamental language modeling papers with summaries, designed to help start a weekly paper club for learning and discussion.

llm Transformer Research

12/24/2023 • EN

Push Notifications: What to Push, What Not to Push, and How Often

Analyzes push notifications as a recommender system, discussing intent, personalization, timeliness, and user engagement challenges.

Machine Learning user engagement Personalization

11/5/2023 • EN

Out-of-Domain Finetuning to Bootstrap Hallucination Detection

Explores using out-of-domain data to improve LLM finetuning for detecting factual inconsistencies (hallucinations) in text summaries.

Machine Learning llm Finetuning

10/15/2023 • EN

Reflections on AI Engineer Summit 2023

Key takeaways from the AI Engineer Summit 2023, focusing on challenges in LLM deployment like evaluation methods and serving costs.

deployment llm AI Engineering

10/9/2023 • EN

AI Engineer 2023 Keynote - Building Blocks for LLM Systems

A summary of a keynote talk on essential building blocks for production LLM systems, covering evaluations, RAG, and guardrails.

production llm Retrieval Augmented Generation

9/3/2023 • EN

Evaluation & Hallucination Detection for Abstractive Summaries

Explores methods for evaluating abstractive text summaries and detecting hallucinations, covering key dimensions and metrics like NLI and QA.

LLM Evaluation Abstractive Summarization Hallucination Detection

8/13/2023 • EN

How to Match LLM Patterns to Problems

A guide to selecting the right LLM architectural patterns (like RAG, fine-tuning, caching) to solve common production challenges such as performance metrics and data constraints.

Rag Fine Tuning LLM Applications

7/30/2023 • EN

Patterns for Building LLM-based Systems & Products

A practical guide outlining seven key patterns for integrating Large Language Models (LLMs) into robust, production-ready systems and products.

llm caching Rag

6/11/2023 • EN

Obsidian-Copilot: An Assistant for Writing & Reflecting

A technical overview of Obsidian-Copilot, a prototype AI assistant for drafting and reflecting within the Obsidian note-taking app using retrieval-augmented generation.

obsidian Retrieval Augmented Generation Semantic Search

5/21/2023 • EN

Some Intuition on Attention and the Transformer

Explains the intuition behind the Attention mechanism and Transformer architecture, focusing on solving issues in machine translation and language modeling.

llm Deep Learning NLP

5/7/2023 • EN

Open-LLMs - A list of LLMs for Commercial Use

A curated list of open-source Large Language Models (LLMs) available for commercial use, including community-contributed updates and details.

Machine Learning open source large language models

4/30/2023 • EN

Interacting with LLMs with Minimal Chat

Explores user interfaces for LLMs that minimize text chat, using clicks and user context for more intuitive interactions.

user experience llm ui-design

4/23/2023 • EN

More Design Patterns For Machine Learning Systems

Explores essential design patterns for building efficient and maintainable machine learning systems in production, focusing on data pipelines and best practices.

Machine Learning software engineering design patterns

4/16/2023 • EN

Raspberry-LLM - Making My Raspberry Pico a Little Smarter

A developer explores running LLMs on a Raspberry Pi Pico with memory constraints, creating a witty e-ink display that generates content from news feeds.

llm iot raspberry pi

4/9/2023 • EN

Experimenting with LLMs to Research, Reflect, and Plan

A developer shares experiments building LLM-powered tools for research, reflection, and planning, including URL summarizers, SQL agents, and advisory boards.

llm Langchain AI Assistant

Previous 1 2 3 4 5 ... 10 Next

Eugene Yan

Articles from this Blog

Select Language