Sebastian Raschka

Sebastian Raschka, PhD, is an LLM Research Engineer and AI expert bridging academia and industry, specializing in large language models, high-performance AI systems, and practical, code-driven machine learning.

https://sebastianraschka.com

RSS Feed

1/5/2026

Large Language Models Artificial Intelligence Machine Learning LLM Research AI Engineering

Articles from this Blog

97 articles from this blog

2/1/2026 • EN

State of AI 2026 with Sebastian Raschka, Nathan Lambert, and Lex Fridman

A 4.5-hour interview discussing the state of AI in 2026, covering LLMs, geopolitics, training, open vs. closed models, AGI timelines, and industry implications.

ai development artificial intelligence large language models

1/24/2026 • EN

Categories of Inference-Time Scaling for Improved LLM Reasoning

An overview of inference-time scaling methods for improving LLM reasoning, categorizing techniques like chain-of-thought and self-consistency.

llm Reasoning Inference Scaling

12/30/2025 • EN

The State Of LLMs 2025: Progress, Problems, and Predictions

A 2025 year-in-review of Large Language Models, covering major developments in reasoning, architecture, costs, and predictions for 2026.

llm AI Research Reinforcement Learning

12/30/2025 • EN

LLM Research Papers: The 2025 List (July to December)

A curated list of notable LLM (Large Language Model) research papers published from July to December 2025, categorized by topic.

Machine Learning artificial intelligence llm

12/8/2025 • EN

From Random Forests to RLVR: A Short History of ML/AI Hello Worlds

A historical overview of beginner-friendly 'Hello World' examples in machine learning and AI, from 2013's Random Forests to 2025's Qwen3 with RLVR.

Machine Learning artificial intelligence Neural Networks

12/3/2025 • EN

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

A technical analysis of DeepSeek V3.2's architecture, sparse attention, and reinforcement learning updates, comparing it to other flagship AI models.

Reinforcement Learning Deepseek LLM Architecture

11/12/2025 • EN

Recommendations for Getting the Most Out of a Technical Book

Author shares a structured method for reading technical books effectively, focusing on understanding concepts and practical coding.

programming education Code Execution LLM Development

11/4/2025 • EN

Beyond Standard LLMs

An overview of alternative LLM architectures beyond standard transformers, including linear attention hybrids, text diffusion models, and world models.

Autoregressive Models LLM Architectures Transformer Alternatives

10/29/2025 • EN

DGX Spark and Mac Mini for Local PyTorch Development

A technical comparison of the DGX Spark and Mac Mini M4 Pro for local PyTorch development and LLM inference, including benchmarks.

llm benchmark local development

10/5/2025 • EN

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

Explores four main methods for evaluating Large Language Models (LLMs), including code examples for implementing each approach from scratch.

benchmarking LLM Evaluation Model Comparison

9/6/2025 • EN

Understanding and Implementing Qwen3 From Scratch

A hands-on tutorial implementing the Qwen3 large language model architecture from scratch using pure PyTorch, explaining its core components.

llm Pytorch Transformer

8/9/2025 • EN

From GPT-2 to gpt-oss: Analyzing the Architectural Advances

Analyzes the architectural advancements in OpenAI's new open-weight gpt-oss models, comparing them to GPT-2 and other modern LLMs.

llm Openai Transformer

7/19/2025 • EN

The Big LLM Architecture Comparison

A technical comparison of architectural changes in major Large Language Models (LLMs) from 2024-2025, focusing on structural innovations beyond benchmarks.

Mixture Of Experts LLM Architecture Transformer Models

7/1/2025 • EN

LLM Research Papers: The 2025 List (January to June)

A curated list of key LLM research papers from the first half of 2025, organized by topic such as reasoning models and reinforcement learning.

Machine Learning artificial intelligence Reinforcement Learning

6/17/2025 • EN

Understanding and Coding the KV Cache in LLMs from Scratch

A technical tutorial explaining the concept and implementation of KV caches for efficient inference in Large Language Models (LLMs).

Attention Mechanism Kv Cache LLM Inference

5/10/2025 • EN

Coding LLMs from the Ground Up: A Complete Course

A course teaching how to code Large Language Models from scratch to deeply understand their inner workings, with practical video tutorials.

Machine Learning llm Neural Networks

4/19/2025 • EN

The State of Reinforcement Learning for LLM Reasoning

Explores the latest developments in using reinforcement learning to improve reasoning capabilities in large language models (LLMs).

Openai Reinforcement Learning Model Training

3/29/2025 • EN

First Look at Reasoning From Scratch: Chapter 1

An introduction to reasoning in Large Language Models, covering key concepts like chain-of-thought and methods to improve LLM reasoning abilities.

Machine Learning artificial intelligence llm

3/8/2025 • EN

Inference-Time Compute Scaling Methods to Improve Reasoning Models

Explores recent research on improving LLM reasoning through inference-time compute scaling methods, comparing various techniques and their impact.

large language models AI Research LLM Reasoning

2/5/2025 • EN

Understanding Reasoning LLMs

Explores four main approaches to building and enhancing reasoning capabilities in Large Language Models (LLMs) for complex tasks.

Reinforcement Learning Deepseek LLM Reasoning

1 2 3 4 5 Next

Sebastian Raschka

Articles from this Blog

Select Language