Sebastian Raschka

SebastianRaschka.com is the personal blog of Sebastian Raschka, PhD, an LLM research engineer whose work bridges academia and industry in AI and machine learning. On his blog and notes section he publishes deep, well-documented articles on topics such as LLMs (large language models), reasoning models, machine learning in Python, neural networks, data science workflows, and deep learning architecture. Recent posts explore advanced themes like “reasoning LLMs”, comparisons of modern open-weight transformer architectures, and guides for building, training, or analyzing neural networks and model internals.

https://sebastianraschka.com/

RSS Feed

11/29/2025

ai machine learning python deep learning llm

Articles from this Blog

98 articles from this blog

8/17/2024 • EN

New LLM Pre-training and Post-training Paradigms

Analyzes the latest pre-training and post-training methodologies used in state-of-the-art LLMs like Qwen 2, Apple's models, Gemma 2, and Llama 3.1.

llm Language Models Pre Training

7/20/2024 • EN

Instruction Pretraining LLMs

Explores recent research on instruction finetuning for LLMs, including a cost-effective method for generating synthetic training data from scratch.

LLM Pretraining Instruction Finetuning Synthetic Data Generation

6/2/2024 • EN

LLM Research Insights: Instruction Masking and New LoRA Finetuning Experiments?

Analysis of new LLM research on instruction masking and LoRA finetuning methods, with practical insights for developers.

Parameter Efficient Finetuning Lora LLM Finetuning

6/2/2024 • EN

Developing an LLM: Building, Training, Finetuning

A 1-hour presentation on the LLM development cycle, covering architecture, training, finetuning, and evaluation methods.

Architecture Tokenization Finetuning

5/12/2024 • EN

How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?

A technical review of April 2024's major open LLM releases (Mixtral, Llama 3, Phi-3, OpenELM) and a comparison of DPO vs PPO for LLM alignment.

llm Reinforcement Learning Transformer

4/20/2024 • EN

Using and Finetuning Pretrained Transformers

Explores methods for using and finetuning pretrained large language models, including feature-based approaches and parameter updates.

Machine Learning large language models ai

3/31/2024 • EN

Tips for LLM Pretraining and Evaluating Reward Models

Analysis of recent AI research papers on continued pretraining for LLMs and reward modeling for RLHF, with insights into model updates and alignment.

Reinforcement Learning LLM Pretraining Reward Modeling

3/3/2024 • EN

Research Papers in February 2024

A summary of February 2024 AI research, covering new open-source LLMs like OLMo and Gemma, and a study on small, fine-tuned models for text summarization.

open source llm AI Research

2/18/2024 • EN

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

A guide to implementing LoRA and the new DoRA method for efficient model finetuning in PyTorch from scratch.

Pytorch Finetuning Lora

9/15/2023 • EN

Optimizing LLMs From a Dataset Perspective

Strategies for improving LLM performance through dataset-centric fine-tuning, focusing on instruction datasets rather than model architecture changes.

llm Neural Networks Dataset

8/10/2023 • EN

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide

A guide to participating in the NeurIPS 2023 LLM Efficiency Challenge, focusing on efficient fine-tuning of large language models on a single GPU.

llm Neural Networks Gpu

7/1/2023 • EN

Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch

Techniques to reduce memory usage by up to 20x when training LLMs and Vision Transformers in PyTorch.

memory optimization Pytorch Gradient Accumulation

6/14/2023 • EN

Finetuning Falcon LLMs More Efficiently With LoRA and Adapters

A guide to efficiently finetuning Falcon LLMs using parameter-efficient methods like LoRA and Adapters to reduce compute time and cost.

Parameter Efficient Finetuning Adapters Lora

5/11/2023 • EN

Accelerating Large Language Models with Mixed-Precision Techniques

Exploring mixed-precision techniques to speed up large language model training and inference by up to 3x without losing accuracy.

large language models Deep Learning Gpu Optimization

4/26/2023 • EN

Parameter-Efficient LLM Finetuning With Low-Rank Adaptation (LoRA)

Learn about Low-Rank Adaptation (LoRA), a parameter-efficient method for finetuning large language models with reduced computational costs.

Machine Learning Parameter Efficient Finetuning Lora

4/12/2023 • EN

Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to LLaMA-Adapters

A guide to parameter-efficient finetuning methods for large language models, covering techniques like prefix tuning and LLaMA-Adapters.

large language models Parameter Efficient Finetuning Prefix Tuning

3/28/2023 • EN

Finetuning Large Language Models On A Single GPU Using Gradient Accumulation

Guide to finetuning large language models on a single GPU using gradient accumulation to overcome memory limitations.

large language models Transformers Gradient Accumulation

3/23/2023 • EN

Keeping Up With AI Research And News

A guide on managing the flood of AI and machine learning research, covering tools and strategies for prioritizing papers and news.

Machine Learning productivity AI Research

2/23/2023 • EN

Some Techniques To Make Your PyTorch Models Train (Much) Faster

Learn techniques to speed up PyTorch model training by 8x using PyTorch Lightning, maintaining accuracy while reducing training time.

performance optimization Pytorch Pytorch Lightning

2/9/2023 • EN

Understanding and Coding the Self-Attention Mechanism of Large Language Models From Scratch

A technical guide to coding the self-attention mechanism from scratch, as used in transformers and large language models.

Python Neural Networks Natural Language Processing

Previous 1 2 3 4 5 Next

Sebastian Raschka

Articles from this Blog

Select Language