Lilian Weng

Lilian Weng is a machine learning researcher documenting deep, well-researched learning notes on large language models, reinforcement learning, and generative AI. Her blog offers clear, structured insights into model reasoning, alignment, hallucinations, and modern ML systems.

https://lilianweng.github.io

RSS Feed

1/25/2026

Large Language Models Machine Learning Research Reinforcement Learning Generative AI AI Alignment

Articles from this Blog

50 articles from this blog

10/29/2020 • EN

How to Build an Open-Domain Question Answering System?

A technical overview of approaches for building open-domain question answering systems using pretrained language models and neural networks.

Neural Networks Language Models AI Assistant

8/6/2020 • EN

Neural Architecture Search

An overview of Neural Architecture Search (NAS), covering its core components: search space, algorithms, and evaluation strategies for automating AI model design.

Machine Learning Neural Networks Deep Learning

6/7/2020 • EN

Exploration Strategies in Deep Reinforcement Learning

An overview of key exploration strategies in Deep Reinforcement Learning, including classic methods and modern approaches for tackling hard-exploration problems.

Deep Reinforcement Learning Exploration Strategies Epsilon Greedy

4/7/2020 • EN

The Transformer Family

An updated overview of the Transformer model family, covering improvements for longer attention spans, efficiency, and new architectures since 2020.

Machine Learning Neural Networks NLP

1/29/2020 • EN

Curriculum for Reinforcement Learning

Explores curriculum learning strategies for training reinforcement learning models more efficiently, from simple to complex tasks.

Machine Learning Neural Networks Reinforcement Learning

11/10/2019 • EN

Self-Supervised Representation Learning

Explores self-supervised learning, a method to train models on unlabeled data by creating supervised tasks, covering key concepts and models.

Deep Learning Unsupervised Learning Representation Learning

9/5/2019 • EN

Evolution Strategies

An introduction to Evolution Strategies (ES) as a black-box optimization alternative to gradient descent, with applications in deep reinforcement learning.

Evolution Strategies Evolutionary Algorithms Stochastic Gradient Descent

6/23/2019 • EN

Meta Reinforcement Learning

Explores meta reinforcement learning, where agents learn to adapt quickly to new, unseen RL tasks, aiming for general-purpose problem-solving algorithms.

Machine Learning artificial intelligence Meta Learning

5/5/2019 • EN

Domain Randomization for Sim2Real Transfer

Explores domain randomization as a technique to bridge the simulation-to-reality gap in robotics and deep reinforcement learning.

simulation Robotics Reinforcement Learning

3/14/2019 • EN

Are Deep Neural Networks Dramatically Overfitted?

Explores the paradox of why deep neural networks generalize well despite having many parameters, discussing theories like Occam's Razor and the Lottery Ticket Hypothesis.

Machine Learning Neural Networks Deep Learning

1/31/2019 • EN

Generalized Language Models

A technical overview of the evolution of large-scale pre-trained language models like BERT, GPT, and T5, focusing on contextual embeddings and transfer learning in NLP.

Gpt Natural Language Processing Language Models

12/27/2018 • EN

Object Detection Part 4: Fast Detection Models

Explores fast, one-stage object detection models like YOLO, SSD, and RetinaNet, comparing them to slower two-stage R-CNN models.

ssd computer vision Deep Learning

11/30/2018 • EN

Meta-Learning: Learning to Learn Fast

An introduction to meta-learning, a machine learning approach where models learn to adapt quickly to new tasks with minimal data, like 'learning to learn'.

Machine Learning Deep Learning Meta Learning

10/13/2018 • EN

Flow-based Deep Generative Models

An introduction to flow-based deep generative models, explaining how they explicitly learn data distributions using normalizing flows, compared to GANs and VAEs.

Machine Learning Deep Learning Generative Models

8/12/2018 • EN

From Autoencoder to Beta-VAE

Explores the evolution from basic Autoencoders to Beta-VAE, covering their architecture, mathematical notation, and applications in dimensionality reduction.

Neural Networks Deep Learning Generative Models

6/24/2018 • EN

Attention? Attention!

Explains the attention mechanism in deep learning, its motivation from human perception, and its role in improving seq2seq models like Transformers.

Machine Learning Neural Networks Deep Learning

5/5/2018 • EN

Implementing Deep Reinforcement Learning Models with Tensorflow + OpenAI Gym

A hands-on tutorial on implementing deep reinforcement learning models using TensorFlow and the OpenAI Gym environment.

Python Virtualenv Tensorflow

4/8/2018 • EN

Policy Gradient Algorithms

A comprehensive overview of policy gradient algorithms in reinforcement learning, covering key concepts, notations, and various methods.

Machine Learning algorithms Deep Learning

2/19/2018 • EN

A (Long) Peek into Reinforcement Learning

An introductory guide to Reinforcement Learning (RL), covering key concepts, algorithms like SARSA and Q-learning, and its role in AI breakthroughs.

Machine Learning artificial intelligence Deep Learning

1/23/2018 • EN

The Multi-Armed Bandit Problem and Its Solutions

Explores the Multi-Armed Bandit problem, a classic dilemma balancing exploration and exploitation in decision-making algorithms.

Machine Learning algorithms Reinforcement Learning

Previous 1 2 3 Next

Lilian Weng

Articles from this Blog

Select Language