Language Models articles

1/19/2021 • EN

Finding the Words to Say: Hidden State Visualizations for Language Models

Explores visualizing hidden states in Transformer language models to understand their internal decision-making process during text generation.

Hidden States Language Models Model Visualization Neural Networks Transformer Models

Jay Alammar

1/2/2021 • EN

Controllable Neural Text Generation

Explores methods for controlling attributes like topic and style in neural text generation using decoding strategies, prompt design, and fine-tuning.

ai Controlled Generation Decoding Strategies Language Models Neural Text Generation

Lilian Weng

12/17/2020 • EN

Interfaces for Explaining Transformer Language Models

Explores interactive methods for interpreting transformer language models, focusing on input saliency and neuron activation analysis.

Interpretability Language Models Neural Networks NLP Transformer

Jay Alammar

10/29/2020 • EN

How to Build an Open-Domain Question Answering System?

A technical overview of approaches for building open-domain question answering systems using pretrained language models and neural networks.

AI Assistant Language Models Neural Networks Open Domain Question Answering Transformer Models

Lilian Weng

7/27/2020 • EN

How GPT3 Works - Visualizations and Animations

A visual guide explaining how GPT-3 is trained and generates text, breaking down its transformer architecture and massive scale.

Attention Mechanism Gpt3 Language Models Neural Networks Transformers

Jay Alammar

6/3/2020 • EN

GPT-3, a Giant Step for Deep Learning and NLP

An analysis of OpenAI's GPT-3 language model, focusing on its 175B parameters, in-context learning capabilities, and performance on NLP tasks.

Deep Learning Gpt 3 In Context Learning Language Models Natural Language Processing

Yoel Zeldes

1/31/2019 • EN

Generalized Language Models

A technical overview of the evolution of large-scale pre-trained language models like BERT, GPT, and T5, focusing on contextual embeddings and transfer learning in NLP.

Bert Gpt Language Models Natural Language Processing Transformer

Lilian Weng

2/18/2014 • EN

Scatterplot of KN/PYP language model results

A scatterplot analysis comparing perplexity results of Kneser-Ney, hierarchical Pitman-Yor, and hierarchical Dirichlet language models from a research paper.

Kneser Ney Language Models Perplexity Pitman Yor Scatterplot

Brendan O’Connor

Language Models Articles

Finding the Words to Say: Hidden State Visualizations for Language Models

Controllable Neural Text Generation

Interfaces for Explaining Transformer Language Models

How to Build an Open-Domain Question Answering System?

How GPT3 Works - Visualizations and Animations

GPT-3, a Giant Step for Deep Learning and NLP

Generalized Language Models

Scatterplot of KN/PYP language model results

Select Language