Lilian Weng

Lilian Weng is a machine learning researcher documenting deep, well-researched learning notes on large language models, reinforcement learning, and generative AI. Her blog offers clear, structured insights into model reasoning, alignment, hallucinations, and modern ML systems.

https://lilianweng.github.io

RSS Feed

1/25/2026

Large Language Models Machine Learning Research Reinforcement Learning Generative AI AI Alignment

Articles from this Blog

50 articles from this blog

5/1/2025 • EN

Why We Think

Explores how increasing 'thinking time' and Chain-of-Thought reasoning improves AI model performance, drawing parallels to human psychology.

large language models Reasoning Chain Of Thought

11/28/2024 • EN

Reward Hacking in Reinforcement Learning

Explores reward hacking in reinforcement learning, where AI agents exploit reward function flaws, and its critical impact on RLHF and language model alignment.

Language Models Reinforcement Learning Rlhf

7/7/2024 • EN

Extrinsic Hallucinations in LLMs

Explores the causes and types of hallucinations in large language models, focusing on extrinsic hallucinations and how training data affects factual accuracy.

llm Pre Training Fine Tuning

4/12/2024 • EN

Diffusion Models for Video Generation

Explores the application of diffusion models to video generation, covering technical challenges, parameterization, and sampling methods.

Machine Learning generative ai Deep Learning

2/5/2024 • EN

Thinking about High-Quality Human Data

Explores the importance of high-quality human-annotated data for training AI models, covering task design, rater selection, and the wisdom of the crowd.

Machine Learning Data Quality Rlhf

10/25/2023 • EN

Adversarial Attacks on LLMs

Explores adversarial attacks and jailbreak prompts that can make large language models produce unsafe or undesired outputs, bypassing safety measures.

security llm large language models

6/23/2023 • EN

LLM Powered Autonomous Agents

An overview of LLM-powered autonomous agents, covering their core components like planning, memory, and tool use for complex problem-solving.

autonomous agents llm Memory

3/15/2023 • EN

Prompt Engineering

An overview of prompt engineering techniques for large language models, including zero-shot and few-shot learning methods.

llm prompt engineering ai

1/27/2023 • EN

The Transformer Family Version 2.0

An updated, comprehensive overview of the Transformer architecture and its many recent improvements, including detailed notation and attention mechanisms.

Neural Networks Deep Learning Natural Language Processing

1/10/2023 • EN

Large Transformer Model Inference Optimization

Explores techniques to optimize inference speed and memory usage for large transformer models, including distillation, pruning, and quantization.

Attention Mechanism Kv Cache Transformer Models

9/8/2022 • EN

Some Math behind Neural Tangent Kernel

A deep dive into the Neural Tangent Kernel (NTK) theory, explaining the math behind why wide neural networks converge during gradient descent training.

optimization mathematics Neural Networks

6/10/2022 • EN

Generalized Visual Language Models

Explores methods for extending pre-trained language models to process visual information, focusing on four approaches for vision-language tasks.

Multimodal AI Bert Vision Language Models

4/16/2022 • EN

Learning with not Enough Data Part 3: Data Generation

Explores synthetic data generation methods like augmentation and pretrained models to overcome limited training data in machine learning.

Machine Learning image processing Data Augmentation

2/20/2022 • EN

Learning with not Enough Data Part 2: Active Learning

Explores active learning strategies for selecting the most valuable data to label when working with a limited labeling budget in machine learning.

Machine Learning Deep Learning Supervised Learning

12/5/2021 • EN

Learning with not Enough Data Part 1: Semi-Supervised Learning

Explores semi-supervised learning techniques for training models when labeled data is scarce, focusing on combining labeled and unlabeled data.

Machine Learning Pre Training Fine Tuning

9/24/2021 • EN

How to Train Really Large Models on Many GPUs?

Explores parallelism techniques and memory optimization strategies for training massive neural networks across multiple GPUs.

large language models Neural Networks Gpu Memory

7/11/2021 • EN

What are Diffusion Models?

An in-depth technical explanation of diffusion models, a class of generative AI models that create data by reversing a noise-adding process.

Machine Learning Neural Networks Deep Learning

5/31/2021 • EN

Contrastive Representation Learning

Explains contrastive representation learning, its objectives like contrastive and triplet loss, and its use in supervised and unsupervised machine learning.

Deep Learning Representation Learning Contrastive Learning