Eugene Yan

Eugene Yan is a Principal Applied Scientist at Amazon, building AI-powered recommendation systems and experiences. He shares insights on RecSys, LLMs, and applied machine learning, while mentoring and investing in ML startups.

https://eugeneyan.com

RSS Feed

1/22/2026

AI machine learning recommendation systems LLMs applied science

Articles from this Blog

186 articles from this blog

8/23/2020 • EN

Embrace Beginner's Mind; Avoid The Wrong Way To Be An Expert

Article discusses the 'expert beginner' trap in tech, where narrow success halts learning, and advocates for maintaining a beginner's mindset.

Machine Learning programming learning

8/16/2020 • EN

NLP for Supervised Learning - A Brief Survey

A chronological survey of key NLP models and techniques for supervised learning, from early RNNs to modern transformers like BERT and T5.

Machine Learning Neural Networks Deep Learning

8/9/2020 • EN

Unpopular Opinion: Data Scientists Should be More End-to-End

Argues that data scientists should own the entire process from problem identification to solution deployment for greater impact and efficiency.

full stack Data Engineering Mlop

8/5/2020 • EN

Adding a Checkbox & Download Button to a FastAPI-HTML app

A tutorial on extending a FastAPI web app with HTML forms to add checkbox functionality and a file download button.

Web Development html forms fastapi

7/26/2020 • EN

Georgia Tech's OMSCS FAQ (based on my experience)

A graduate's detailed FAQ about Georgia Tech's Online Master's in Computer Science (OMSCS), covering costs, admissions, courses, and career impact.

software engineering computer science Online Education

7/23/2020 • EN

How to Set Up a HTML App with FastAPI, Jinja, Forms & Templates

A tutorial on building a web application with HTML forms and templates using the FastAPI framework and Jinja templating engine.

Web Development templates html forms

7/19/2020 • EN

Why You Need to Follow Up After Your Data Science Project

Explains the importance of post-project follow-up in data science, focusing on code cleanup, Jupyter notebook version control issues, and documentation.

git version control productivity

7/12/2020 • EN

What I Do During A Data Science Project To Deliver Success

A data scientist shares practical habits and workflows for executing successful data science projects, focusing on research, experimentation, and team alignment.

productivity execution project management

7/11/2020 • EN

How to Update a GitHub Profile README Automatically

A tutorial on automating GitHub profile README updates using Python and GitHub Actions to display recent blog posts.

Python automation rss

7/5/2020 • EN

My Notes From Spark+AI Summit 2020 (Application-Specific Talks)

Notes from Spark+AI Summit 2020 covering application-specific talks on ML frameworks, data engineering, feature stores, and data quality from companies like Airbnb and Netflix.

Machine Learning production Spark

6/28/2020 • EN

My Notes From Spark+AI Summit 2020 (Application-Agnostic Talks)

Summary of key application-agnostic talks from Spark+AI Summit 2020, focusing on scaling and optimizing deep learning models.

Machine Learning Apache Spark Deep Learning

6/21/2020 • EN

How to Set Up a Python Project For Automation and Collaboration

A guide to setting up a Python project with automated testing, linting, and type-checking to improve code quality and team collaboration.

Python testing code quality

6/21/2020 • EN

Mailbag: Qns on the Intersection of Data Science and Business

Answers common questions about data science in business, covering requirements, model interpretability, web scraping, and team roles.

Machine Learning web scraping Data Engineering

6/17/2020 • EN

Why Are My Airflow Jobs Running “One Day Late”?

Explains why Apache Airflow jobs appear to run a day late due to its scheduling logic, contrasting it with cron jobs.

Cron Scheduling Data Pipelines

6/15/2020 • EN

What I Do Before a Data Science Project to Ensure Success

A data scientist shares three essential pre-project tasks—the one-pager, time-box, and breakdown—to avoid common pitfalls and ensure project success.

productivity data analysis project management

6/7/2020 • EN

What I Love about Scrum for Data Science

A data scientist shares how adopting Scrum, despite initial resistance, improved project management and delivery for data science teams.

software development agile project management

5/25/2020 • EN

A Practical Guide to Maintaining Machine Learning in Production

A guide to best practices for monitoring, maintaining, and managing machine learning models and data pipelines in a production environment.

Machine Learning production Monitoring

5/18/2020 • EN

6 Little-Known Challenges After Deploying Machine Learning

Explores six unexpected challenges that arise after deploying machine learning models in production, from data schema changes to organizational issues.

Machine Learning deployment production