7/13/2022
•
EN
Optimizing Transformers for GPUs with Optimum
Learn to optimize Hugging Face Transformers models for GPU inference using Optimum and ONNX Runtime to reduce latency.