Finetuning articles

2/1/2025 • EN

Finetune Granite3.1 for Reasoning

A technical guide on fine-tuning IBM's Granite3.1 AI model using Guided Reward Policy Optimization (GRPO) to enhance its reasoning capabilities.

Finetuning Granite31 Grpo Reasoning Reinforcement Learning

Ruslan Magana Vsevolodovna

1/26/2025 • EN

Notes on ‘AI Engineering’ (Chip Huyen) chapter 7: Finetuning

A summary of Chip Huyen's chapter on AI fine-tuning, arguing it's a last resort after prompt engineering and RAG, detailing its technical and organizational complexities.

AI Engineering Finetuning Lora Peft Rag

Alex Strick van Linschoten

9/21/2024 • EN

Building A GPT-Style LLM Classifier From Scratch

A guide to transforming pretrained LLMs into text classifiers, with insights from the author's new book on building LLMs from scratch.

classification Finetuning Gpt llm Text Classification

Sebastian Raschka

6/2/2024 • EN

Developing an LLM: Building, Training, Finetuning

A 1-hour video presentation covering the full development cycle of Large Language Models, from architecture and pretraining to finetuning and evaluation.

Finetuning LLM Development Pretraining Tokenization Training

Sebastian Raschka

6/2/2024 • EN

Developing an LLM: Building, Training, Finetuning

A 1-hour presentation on the LLM development cycle, covering architecture, training, finetuning, and evaluation methods.

Architecture Finetuning LLM Development Tokenization Training

Sebastian Raschka

6/2/2024 • EN

LLM Research Insights: Instruction Masking and New LoRA Finetuning Experiments?

Explores new research on instruction masking and LoRA finetuning techniques for improving large language models (LLMs).

Finetuning Instruction Tuning llm Lora Research

Sebastian Raschka

4/20/2024 • EN

Using and Finetuning Pretrained Transformers

Explores methods for using and finetuning pretrained large language models, including feature-based approaches and parameter updates.

ai Finetuning large language models Machine Learning Transformers

Sebastian Raschka

3/3/2024 • EN

Research Papers in February 2024

A summary of key AI research papers from February 2024, focusing on new open-source LLMs, small fine-tuned models, and efficient fine-tuning techniques.

AI Research Finetuning Gemma llm open source

Sebastian Raschka

3/3/2024 • EN

Research Papers in February 2024

A summary of February 2024 AI research, covering new open-source LLMs like OLMo and Gemma, and a study on small, fine-tuned models for text summarization.

AI Research Finetuning llm open source Summarization

Sebastian Raschka

2/18/2024 • EN

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

A technical guide implementing DoRA, a new low-rank adaptation method for efficient model finetuning, from scratch in PyTorch.

Dora Finetuning Lora Low Rank Adaptation Pytorch

Sebastian Raschka

2/18/2024 • EN

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

A guide to implementing LoRA and the new DoRA method for efficient model finetuning in PyTorch from scratch.

Dora Finetuning Lora Low Rank Adaptation Pytorch

Sebastian Raschka

2/11/2024 • EN

How to Generate and Use Synthetic Data for Finetuning

Explores methods for generating synthetic data (distillation & self-improvement) to fine-tune LLMs for pretraining, instruction-tuning, and preference-tuning.

Finetuning Instruction Tuning llm Preference Tuning Synthetic Data

Eugene Yan

11/5/2023 • EN

Out-of-Domain Finetuning to Bootstrap Hallucination Detection

Explores using out-of-domain data to improve LLM finetuning for detecting factual inconsistencies (hallucinations) in text summaries.

Finetuning Hallucination Detection llm Machine Learning Natural Language Inference

Eugene Yan

9/15/2023 • EN

Optimizing LLMs From a Dataset Perspective

Explores dataset-centric strategies for fine-tuning LLMs, focusing on instruction datasets to improve model performance without altering architecture.

Dataset Finetuning Instruction Tuning llm Neural Networks

Sebastian Raschka

9/15/2023 • EN

Optimizing LLMs From a Dataset Perspective

Strategies for improving LLM performance through dataset-centric fine-tuning, focusing on instruction datasets rather than model architecture changes.

Dataset Finetuning Instruction Tuning llm Neural Networks

Sebastian Raschka

8/10/2023 • EN

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide

A guide to participating in the NeurIPS 2023 LLM Efficiency Challenge, focusing on efficient fine-tuning of large language models on a single GPU.

Efficient Training Finetuning Gpu llm Neural Networks

Sebastian Raschka

8/10/2023 • EN

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide

A guide to participating in the NeurIPS 2023 LLM Efficiency Challenge, covering setup, rules, and strategies for efficient LLM fine-tuning on limited hardware.

Finetuning Gpu Optimization LLM Efficiency Neural Networks Neurips

Sebastian Raschka