Fine Tuning articles

10/5/2025 • EN

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

Explores four main methods for evaluating Large Language Models (LLMs), including code examples for implementing each approach from scratch.

benchmarking Fine Tuning LLM Evaluation Model Comparison Reasoning Models

Sebastian Raschka

10/5/2025 • EN

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

A guide to the four main methods for evaluating Large Language Models, including code examples and practical implementation details.

benchmarking Fine Tuning LLM Evaluation Model Comparison Reasoning Models

Sebastian Raschka

4/7/2025 • EN

A Journey from AI to LLMs and MCP - 3 - Boosting LLM Performance — Fine-Tuning, Prompt Engineering, and RAG

Explores three key methods to enhance LLM performance: fine-tuning, prompt engineering, and RAG, detailing their use cases and trade-offs.

ai Fine Tuning llm prompt engineering Retrieval Augmented Generation

Alex Merced

1/23/2025 • EN

Noteworthy LLM Research Papers of 2024

A curated list of 12 influential LLM research papers from 2024, highlighting key advancements in AI and machine learning.

Fine Tuning llm Lora Mixture Of Experts Research Papers

Sebastian Raschka

12/25/2024 • EN

Fine-tune classifier with ModernBERT in 2025

A tutorial on fine-tuning the ModernBERT model for classification tasks to build an efficient LLM router, covering setup, training, and evaluation.

Bert classification Fine Tuning LLM Routing Modernbert

Philipp Schmid

9/30/2024 • EN

How to Fine-Tune Multimodal Models or VLMs with Hugging Face TRL

A technical guide on fine-tuning Vision-Language Models (VLMs) using Hugging Face's TRL library for custom applications like image-to-text generation.

Fine Tuning Hugging Face Multimodal Models Trl Vision Language Models

Philipp Schmid

8/17/2024 • EN

New LLM Pre-training and Post-training Paradigms

Analyzes the latest pre-training and post-training methodologies used in state-of-the-art LLMs like Qwen 2, Apple's models, Gemma 2, and Llama 3.1.

Fine Tuning Language Models llm Post Training Pre Training

Sebastian Raschka

7/7/2024 • EN

Extrinsic Hallucinations in LLMs

Explores the causes and types of hallucinations in large language models, focusing on extrinsic hallucinations and how training data affects factual accuracy.

Factuality Fine Tuning Hallucination llm Pre Training

Lilian Weng

6/11/2024 • EN

Fine-tune Llama 3 with PyTorch FSDP and Q-Lora on Amazon SageMaker

A technical guide on fine-tuning the Llama 3 LLM using PyTorch FSDP and Q-Lora on Amazon SageMaker for efficient training.

Amazon Sagemaker Fine Tuning Llama 3 Pytorch Fsdp Q Lora

Philipp Schmid

6/4/2024 • EN

Fine-tune Embedding models for Retrieval Augmented Generation (RAG)

A guide to fine-tuning embedding models for RAG applications using Sentence Transformers 3, featuring Matryoshka Representation Learning for efficiency.

Embedding Models Fine Tuning Matryoshka Representation Learning Rag Sentence Transformers

Philipp Schmid

4/22/2024 • EN

Efficiently fine-tune Llama 3 with PyTorch FSDP and Q-Lora

A technical guide on fine-tuning the Llama 3 70B model using PyTorch FSDP and Q-Lora for efficient training on limited GPU hardware.

Fine Tuning large language models Llama 3 Pytorch Fsdp Q Lora

Philipp Schmid

8/13/2023 • EN

How to Match LLM Patterns to Problems

A guide to selecting the right LLM architectural patterns (like RAG, fine-tuning, caching) to solve common production challenges such as performance metrics and data constraints.

Fine Tuning LLM Applications LLM Patterns LLM Production Rag

Eugene Yan

7/30/2023 • EN

Patterns for Building LLM-based Systems & Products

A practical guide outlining seven key patterns for integrating Large Language Models (LLMs) into robust, production-ready systems and products.

caching Fine Tuning Guardrails llm Rag

Eugene Yan

5/23/2023 • EN

Generative AI for Document Understanding with Hugging Face and Amazon SageMaker

Tutorial on fine-tuning and deploying the Donut model for OCR-free document understanding using Hugging Face and Amazon SageMaker.

Amazon Sagemaker Document Understanding Fine Tuning generative ai Hugging Face

Philipp Schmid

2/22/2023 • EN

Combine Amazon SageMaker and DeepSpeed to fine-tune FLAN-T5 XXL

Guide to fine-tuning the large FLAN-T5 XXL model using Amazon SageMaker managed training and DeepSpeed for optimization.

Amazon Sagemaker Deepspeed Fine Tuning Flan T5 large language models

Philipp Schmid

10/13/2022 • EN

Fine-tuning LayoutLM for document-understanding using Keras and Hugging Face Transformers

A tutorial on fine-tuning Microsoft's LayoutLM model for document understanding using TensorFlow, Keras, and the FUNSD dataset.

Document Understanding Fine Tuning Hugging Face Transformers Kera Layoutlm

Philipp Schmid

12/7/2021 • EN

Hugging Face Transformers BERT fine-tuning using Amazon SageMaker and Training Compiler

Guide to fine-tuning a Hugging Face BERT model for text classification using Amazon SageMaker and the new Training Compiler to accelerate training.

Amazon Sagemaker Bert Fine Tuning Hugging Face Transformers

Philipp Schmid

12/5/2021 • EN

Learning with not Enough Data Part 1: Semi-Supervised Learning

Explores semi-supervised learning techniques for training models when labeled data is scarce, focusing on combining labeled and unlabeled data.

Data Scarcity Fine Tuning Machine Learning Pre Training Semi Supervised Learning

Lilian Weng

11/28/2019 • EN

The Accessibility of GPT-2 - Text Generation and Fine-tuning

A tutorial on using HuggingFace's API to access and fine-tune OpenAI's GPT-2 model for text generation.

Fine Tuning Gpt 2 NLP Text Generation Transformers

Yoel Zeldes

Fine Tuning Articles

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

A Journey from AI to LLMs and MCP - 3 - Boosting LLM Performance — Fine-Tuning, Prompt Engineering, and RAG

Noteworthy LLM Research Papers of 2024

Fine-tune classifier with ModernBERT in 2025

How to Fine-Tune Multimodal Models or VLMs with Hugging Face TRL

New LLM Pre-training and Post-training Paradigms

Extrinsic Hallucinations in LLMs

Fine-tune Llama 3 with PyTorch FSDP and Q-Lora on Amazon SageMaker

Fine-tune Embedding models for Retrieval Augmented Generation (RAG)

Efficiently fine-tune Llama 3 with PyTorch FSDP and Q-Lora

How to Match LLM Patterns to Problems

Patterns for Building LLM-based Systems & Products

Generative AI for Document Understanding with Hugging Face and Amazon SageMaker

Combine Amazon SageMaker and DeepSpeed to fine-tune FLAN-T5 XXL

Fine-tuning LayoutLM for document-understanding using Keras and Hugging Face Transformers

Hugging Face Transformers BERT fine-tuning using Amazon SageMaker and Training Compiler

Learning with not Enough Data Part 1: Semi-Supervised Learning

The Accessibility of GPT-2 - Text Generation and Fine-tuning

Select Language