Philipp Schmid • 3/1/2024

How to fine-tune Google Gemma with ChatML and Hugging Face TRL

This article provides a step-by-step tutorial for fine-tuning Google's Gemma open language models (2B and 7B parameter versions) using the ChatML format and the Hugging Face TRL (Transformer Reinforcement Learning) library. It covers setting up the development environment, preparing datasets, using the SFTTrainer, and running the process on consumer-grade GPUs like the RTX 4090.

0 comments

#Hugging Face #LLM Fine Tuning #Trl