Sebastian Raschka

SebastianRaschka.com is the personal blog of Sebastian Raschka, PhD, an LLM research engineer whose work bridges academia and industry in AI and machine learning. On his blog and notes section he publishes deep, well-documented articles on topics such as LLMs (large language models), reasoning models, machine learning in Python, neural networks, data science workflows, and deep learning architecture. Recent posts explore advanced themes like “reasoning LLMs”, comparisons of modern open-weight transformer architectures, and guides for building, training, or analyzing neural networks and model internals.

https://sebastianraschka.com/

RSS Feed

11/29/2025

ai machine learning python deep learning llm

Articles from this Blog

98 articles from this blog

2/7/2023 • EN

Understanding Large Language Models -- A Transformative Reading List

A curated reading list of key academic papers for understanding the development and architecture of large language models and transformers.

Machine Learning large language models Transformers

2/1/2023 • EN

What Are the Different Approaches for Detecting Content Generated by LLMs Such As ChatGPT? And How Do They Work and Differ?

An overview of four different methods for detecting AI-generated text, including OpenAI's AI Classifier, DetectGPT, GPTZero, and watermarking.

llm Openai Chatgpt

1/29/2023 • EN

Comparing Different Automatic Image Augmentation Methods in PyTorch

A comparison of AutoAugment, RandAugment, AugMix, and TrivialAugment image augmentation methods in PyTorch for reducing overfitting.

Pytorch Image Augmentation Autoaugment

1/16/2023 • EN

Curated Resources and Trustworthy Experts: The Key Ingredients for Finding Accurate Answers to Technical Questions in the Future

Analyzes the limitations of AI chatbots like ChatGPT in providing accurate technical answers and discusses the need for curated data and human experts.

large language models Chatgpt LLM Training

1/15/2023 • EN

Training an XGBoost Classifier Using Cloud GPUs Without Worrying About Infrastructure

Learn how to train an XGBoost classifier using cloud GPUs without managing infrastructure via the Lightning AI framework.

Machine Learning Infrastructure Xgboost

1/5/2023 • EN

Open Source Highlights 2022 for Machine Learning & AI

A curated list of the top 10 open-source machine learning and AI projects released or updated in 2022, including PyTorch 2.0 and scikit-learn 1.2.

Machine Learning open source Neural Networks

1/3/2023 • EN

Influential Machine Learning Papers Of 2022

A review of the top 10 most influential machine learning papers from 2022, including ConvNeXt and MaxViT, with technical analysis.

Machine Learning computer vision Convolutional Neural Networks

10/15/2022 • EN

Ahead Of AI, And What's Next?

Author announces the launch of 'Ahead of AI', a monthly newsletter covering AI trends, educational content, and personal updates on machine learning projects.

Machine Learning Neural Networks Deep Learning

7/24/2022 • EN

A Short Chronology Of Deep Learning For Tabular Data

A curated list and summary of recent research papers exploring deep learning methods specifically designed for tabular data.

Machine Learning tabular data Deep Learning

7/5/2022 • EN

No, We Don't Have to Choose Batch Sizes As Powers Of 2

Challenges the common practice of using powers of 2 for neural network batch sizes, examining the theory and practical benchmarks.

memory alignment Neural Networks Deep Learning

6/30/2022 • EN

Sharing Deep Learning Research Models with Lightning Part 2: Leveraging the Cloud

Learn how to deploy a deep learning research demo on the cloud using the Lightning framework, including GPU training and model sharing.

Deep Learning Lightning Framework Cloud Deployment

6/17/2022 • EN

Sharing Deep Learning Research Models with Lightning Part 1: Building A Super Resolution App

Learn to build a Super Resolution GAN demo using the Lightning framework in this first part of a deep learning tutorial series.

Deep Learning Pytorch Lightning Super Resolution

6/12/2022 • EN

Taking Datasets, DataLoaders, and PyTorch’s New DataPipes for a Spin

A hands-on exploration of PyTorch's new DataPipes for efficient data loading, comparing them to traditional Datasets and DataLoaders.

dataloader Dataset Pytorch

5/18/2022 • EN

Running PyTorch on the M1 GPU

A hands-on review of PyTorch's new M1 GPU support, including installation steps and performance benchmarks for deep learning tasks.

Neural Networks Deep Learning Pytorch

4/25/2022 • EN

Creating Confidence Intervals for Machine Learning Classifiers

A guide to creating confidence intervals for evaluating machine learning models, covering multiple methods to quantify performance uncertainty.

Machine Learning statistics performance metrics

4/4/2022 • EN

Losses Learned

A guide to correctly implementing cross-entropy loss in PyTorch for binary and multiclass classification, explaining common pitfalls and best practices.

Pytorch Cross Entropy Loss Binary Classification

3/24/2022 • EN

TorchMetrics

Explains the difference between .update() and .forward() in TorchMetrics, a PyTorch library for tracking model performance during training.

metrics Deep Learning Model Evaluation

2/25/2022 • EN

Machine Learning with PyTorch and Scikit-Learn

Author announces a new machine learning book covering scikit-learn, deep learning with PyTorch, neural networks, and reinforcement learning.

Machine Learning Neural Networks Deep Learning

12/29/2021 • EN

Introduction to Machine Learning

A comprehensive collection of 90 machine learning lecture videos covering Python, scikit-learn, algorithms, and model evaluation techniques.

Python Machine Learning Deep Learning

7/9/2021 • EN

Introduction to Deep Learning

A comprehensive deep learning course covering fundamentals, neural networks, computer vision, and generative models using PyTorch.

Machine Learning computer vision Neural Networks

Previous 1 2 3 4 5 Next

Sebastian Raschka

Articles from this Blog

Select Language