Model Training articles

12/7/2025 • EN

Agents, Context, and the Real Work of AI Development

A developer reflects on AI agent architectures, context management, and the industry's overemphasis on model development vs. building applications.

Agent Architectures ai development Context Windows Model Training Reinforcement Learning

Mark Tinderholt

12/4/2025 • EN

#AI horizons 25-11 – Kimi K2 Thinking and the New AI Balance of Power

Analysis of China's Kimi K2 Thinking AI model, a low-cost, open-weight model challenging US dominance in reasoning and agentic tasks.

artificial intelligence Benchmarks Model Training Open Weight Models Reasoning Models

Daniele Grandini

12/2/2025 • EN

Claude 4.5 Opus' Soul Document

Anthropic's internal 'soul document' used to train Claude 4.5 Opus's personality and values has been confirmed and partially revealed.

AI Safety Anthropic Claude llm Model Training

Simon Willison

9/5/2025 • EN

In Defense of AI Evals, for Everyone

A defense of systematic AI evaluation (evals) in development, arguing they are essential for measuring application quality and improving models.

AI Evaluation Machine Learning Model Training Quality Assurance software development

Shreya Shankar

7/14/2025 • EN

Why your AI might be biased (and what you can do about it)

Explains the causes of bias in AI systems, focusing on training data and proxy variables, and offers practical steps for developers to mitigate it.

ai bias Algorithmic Fairness Data Quality Machine Learning Ethics Model Training

Leo Visser

4/19/2025 • EN

The State of Reinforcement Learning for LLM Reasoning

Explores the latest developments in using reinforcement learning to improve reasoning capabilities in large language models (LLMs).

LLM Reasoning Model Training Openai Ppo Reinforcement Learning

Sebastian Raschka

4/19/2025 • EN

The State of Reinforcement Learning for LLM Reasoning

Analyzes the use of reinforcement learning to enhance reasoning capabilities in large language models (LLMs) like GPT-4.5 and o3.

LLM Reasoning Model Training Ppo Reinforcement Learning Rlhf

Sebastian Raschka

6/29/2024 • EN

Understanding Input Masking in LLM Finetuning

Explains the concept and purpose of input masking in LLM fine-tuning, using a practical example with Axolotl for a code PR classification task.

Axolotl Input Masking LLM Finetuning Model Training Training Data

Saeed Esmaili

7/1/2023 • EN

Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch

A guide to 9 PyTorch techniques for drastically reducing memory usage when training vision transformers and LLMs, enabling training on consumer hardware.

Deep Learning memory optimization Model Training Pytorch Transformers

Sebastian Raschka

2/23/2023 • EN

Some Techniques To Make Your PyTorch Models Train (Much) Faster

Techniques to accelerate PyTorch model training by 8x using PyTorch Lightning, with a DistilBERT fine-tuning example.

Lightning Model Training performance optimization Pytorch Transformer

Sebastian Raschka

2/23/2023 • EN

Some Techniques To Make Your PyTorch Models Train (Much) Faster

Learn techniques to speed up PyTorch model training by 8x using PyTorch Lightning, maintaining accuracy while reducing training time.

Model Training performance optimization Pytorch Pytorch Lightning Transformer

Sebastian Raschka

6/30/2022 • EN

Sharing Deep Learning Research Models with Lightning Part 2: Leveraging the Cloud

Learn how to deploy a deep learning research demo on the cloud using the Lightning framework, including GPU training and model sharing.

Cloud Deployment Deep Learning Gpu Resources Lightning Framework Model Training

Sebastian Raschka

3/22/2022 • EN

Save up to 90% training cost with AWS Spot Instances and Hugging Face Transformers

A technical guide on using AWS Spot Instances with Hugging Face Transformers on Amazon SageMaker to reduce machine learning training costs by up to 90%.

Amazon Sagemaker AWS Spot Instances Hugging Face Transformers Managed Spot Training Model Training

Philipp Schmid

6/10/2018 • EN

Simple Machine Learning with .NET Core Sample

A tutorial on implementing a binary classification machine learning model using ML.NET in .NET Core to predict Titanic passenger survival.

Binary Classification Machine Learning Mlnet Model Training Net Core

Carlos Mendible

Model Training Articles

Agents, Context, and the Real Work of AI Development

#AI horizons 25-11 – Kimi K2 Thinking and the New AI Balance of Power

Claude 4.5 Opus' Soul Document

In Defense of AI Evals, for Everyone

Why your AI might be biased (and what you can do about it)

The State of Reinforcement Learning for LLM Reasoning

The State of Reinforcement Learning for LLM Reasoning

Understanding Input Masking in LLM Finetuning

Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch

Some Techniques To Make Your PyTorch Models Train (Much) Faster

Some Techniques To Make Your PyTorch Models Train (Much) Faster

Sharing Deep Learning Research Models with Lightning Part 2: Leveraging the Cloud

Save up to 90% training cost with AWS Spot Instances and Hugging Face Transformers

Simple Machine Learning with .NET Core Sample

Select Language