Philipp Schmid 2/22/2023

Combine Amazon SageMaker and DeepSpeed to fine-tune FLAN-T5 XXL

Read Original

This technical tutorial explains how to integrate DeepSpeed into Amazon SageMaker to fine-tune the large language model FLAN-T5 XXL. It covers using model parallelism, multiple GPUs, and DeepSpeed ZeRO on SageMaker to manage infrastructure and efficiently train billion-parameter models, including dataset preparation and script configuration.

Combine Amazon SageMaker and DeepSpeed to fine-tune FLAN-T5 XXL

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser