Lilian Weng

Lilian Weng is a machine learning researcher documenting deep, well-researched learning notes on large language models, reinforcement learning, and generative AI. Her blog offers clear, structured insights into model reasoning, alignment, hallucinations, and modern ML systems.

https://lilianweng.github.io

RSS Feed

1/25/2026

Large Language Models Machine Learning Research Reinforcement Learning Generative AI AI Alignment

Articles from this Blog

50 articles from this blog

12/31/2017 • EN

Object Detection for Dummies Part 3: R-CNN Family

Explores the R-CNN family of models for object detection, covering R-CNN, Fast R-CNN, Faster R-CNN, and Mask R-CNN with technical details.

computer vision Deep Learning Cnn

12/15/2017 • EN

Object Detection for Dummies Part 2: CNN, DPM and Overfeat

Explores classic CNN architectures for image classification, including AlexNet, VGG, and ResNet, as foundational models for object detection.

computer vision Deep Learning Cnn

10/29/2017 • EN

Object Detection for Dummies Part 1: Gradient Vector, HOG, and SS

An introductory guide to the fundamental concepts of object detection, covering image gradients, HOG, and segmentation, as a precursor to deep learning methods.

computer vision image processing Object Detection

10/15/2017 • EN

Learning Word Embedding

Explains word embeddings, comparing count-based and context-based methods like skip-gram for converting words into dense numeric vectors.

Machine Learning NLP Natural Language Processing

9/28/2017 • EN

Anatomize Deep Learning with Information Theory

Explores applying information theory, specifically the Information Bottleneck method, to analyze training phases and learning bounds in deep neural networks.

Neural Networks Deep Learning Information Theory

8/20/2017 • EN

From GAN to WGAN

Explains the math behind GANs, their training challenges, and introduces WGAN as a solution for improved stability.

Machine Learning Deep Learning Generative Adversarial Networks

8/1/2017 • EN

How to Explain the Prediction of a Machine Learning Model?

Explores the importance of interpreting ML model predictions, especially in regulated fields, and reviews methods like linear regression and interpretable models.

Machine Learning Deep Learning Ethics