Eugene Yan

Eugene Yan is a Principal Applied Scientist at Amazon, building AI-powered recommendation systems and experiences. He shares insights on RecSys, LLMs, and applied machine learning, while mentoring and investing in ML startups.

https://eugeneyan.com

RSS Feed

1/22/2026

AI machine learning recommendation systems LLMs applied science

Articles from this Blog

185 articles from this blog

3/19/2023 • EN

LLM-powered Biographies

An experiment comparing how different large language models (GPT-4, Claude, Cohere) write a biography, analyzing their accuracy and training data.

llm Gpt 4 Claude

3/12/2023 • EN

How to Write Data Labeling/Annotation Guidelines

A guide on creating effective data labeling guidelines for machine learning, covering principles like Why, What, and How, with examples from Google and Bing.

Machine Learning Guidelines Data Annotation

2/26/2023 • EN

Content Moderation & Fraud Detection - Patterns in Industry

Explores five industry patterns for building robust content moderation and fraud detection systems using ML, including human-in-the-loop and data augmentation.

Machine Learning content moderation Supervised Learning

2/5/2023 • EN

Mechanisms for Effective Technical Teams

Explores practical team mechanisms like end-of-week debriefs and monthly learning sessions to boost productivity and collaboration in technical teams.

software development code review technical leadership

1/22/2023 • EN

Mechanisms for Effective Machine Learning Projects

Explores practical mechanisms like pilot/copilot roles and literature reviews to improve the success rate of machine learning projects.

Machine Learning code review project management

1/15/2023 • EN

Goodbye Roam Research, Hello Obsidian

A developer shares their experience migrating from Roam Research to Obsidian for note-taking, including steps, plugins, and syncing setup.

git productivity obsidian

1/8/2023 • EN

What To Do If Dependency Teams Can’t Help

Strategies for managing team dependencies in tech organizations when other teams can't provide support, focusing on understanding constraints and building trust.

software development project management leadership

12/24/2022 • EN

2022 in Review & 2023 Goals

A data scientist reviews his 2022 goals, including technical writing on ML topics and career progression, and sets new goals for 2023.

Python Machine Learning Pytorch

12/11/2022 • EN

Autoencoders and Diffusers: A Brief Comparison

Compares autoencoders and diffusers, explaining their architectures, learning paradigms, and key differences in deep learning.

Neural Networks Deep Learning Diffusion Models

11/27/2022 • EN

Text-to-Image: Diffusion, Text Conditioning, Guidance, Latent Space

Explains core concepts behind modern text-to-image AI models like DALL-E 2 and Stable Diffusion, including diffusion, text conditioning, and latent space.

Deep Learning Text To Image Diffusion Models

10/2/2022 • EN

RecSys 2022: Recap, Favorite Papers, and Lessons

A recap of the RecSys 2022 conference, highlighting key trends, favorite papers, and lessons learned in recommendation systems.

Graph Neural Networks Recommender Systems Sequential Recommendation

9/23/2022 • EN

RecSys 2022 Keynote - Is the Juice Worth the Squeeze?

A keynote exploring the trade-offs between batch and online recommender systems, with real-world examples from Amazon Books.

Recommender Systems Online Systems Batch Systems

9/4/2022 • EN

Writing Robust Tests for Data & Machine Learning Pipelines

Explores why data and ML pipeline tests break incorrectly and offers strategies for writing more robust unit, schema, and integration tests.

Machine Learning unit testing software testing

8/14/2022 • EN

Simplicity is An Advantage but Sadly Complexity Sells Better

Explores why complex ideas and systems are often favored over simpler ones in tech and academia, and argues for the advantages of simplicity.

Machine Learning software development complexity

7/31/2022 • EN

Uncommon Uses of Python in Commonly Used Libraries

Explores advanced Python techniques like using super() in base classes for cooperative multiple inheritance, based on analysis of popular libraries.

Python super Multiple Inheritance

6/26/2022 • EN

Why You Should Write Weekly 15-5s

Explains the benefits of writing weekly 15-5 reports for productivity, visibility, and team trust in a tech/engineering context.

software development career development productivity

6/12/2022 • EN

Design Patterns in Machine Learning Code and Systems

Explores the application of classic software design patterns, like the Factory pattern, to machine learning code and systems, using examples from PyTorch, Gensim, and Hugging Face.

Python Machine Learning design patterns

5/22/2022 • EN

What I Wish I Knew About Onboarding Effectively

A senior tech professional shares practical guidelines and mindset strategies for effectively onboarding into a new mid-to-senior role in the tech industry.

software development productivity engineering culture

5/8/2022 • EN

Bandits for Recommender Systems

Explores bandit algorithms like ε-greedy, UCB, and Thompson Sampling to improve recommender systems by balancing exploration and exploitation.

Machine Learning Reinforcement Learning Recommender Systems

4/17/2022 • EN

How to Measure and Mitigate Position Bias

Explains position bias in recommendation systems and methods to measure and reduce its impact on user engagement and model fairness.

user engagement Data Bias Recommender Systems

Previous 1 2 3 4 5 ... 10 Next

Eugene Yan

Articles from this Blog

Select Language