Gpu articles

12/20/2025 • EN

Big GPUs don't need big PCs

Testing GPU performance on a Raspberry Pi 5 versus a desktop PC for transcoding, AI, and multi-GPU tasks, showing surprising efficiency.

AI Inference Gpu llm Pcie raspberry pi

Jeff Geerling

11/11/2025 • EN

The Production Generative AI Stack: Architecture and Components

Explains the multi-layered architecture of production generative AI systems, covering hardware, models, orchestration, and tooling.

AI Architecture AI Infrastructure Asic generative ai Gpu

Janakiram MSV

10/29/2025 • EN

DGX Spark and Mac Mini for Local PyTorch Development

Compares DGX Spark and Mac Mini for local PyTorch development, focusing on LLM inference and fine-tuning performance benchmarks.

benchmark Gpu Inference llm Pytorch

Sebastian Raschka

8/4/2025 • EN

GPU vs CPU – Local Image Generation in C# with TransformersSharp

Benchmarking GPU vs CPU performance for local AI image generation in C# using the TransformersSharp library and Hugging Face models.

c Cuda Gpu image generation Transformerssharp

Bruno Capuano

2/26/2025 • EN

Running DeepSeek open reasoning models on GKE

A technical guide on deploying DeepSeek's open reasoning AI models on Google Kubernetes Engine (GKE) using vLLM and a Gradio interface.

Deepseek Gke Gpu Inference Kubernetes

William Denniss

2/14/2025 • EN

How useful is GPU manufacturer TDP for estimating AI workload energy?

Analyzes the limitations of using GPU manufacturer TDP for estimating AI workload energy consumption, highlighting real-world measurement challenges.

AI Workloads energy efficiency Gpu power consumption Tdp

David Mytton

4/22/2024 • EN

Running Python on a serverless GPU instance for machine learning inference

A guide to running Python code on serverless GPU instances using Modal.com for faster machine learning inference, demonstrated with a speech-to-text example.

Gpu Inference Machine Learning Modal serverless

Saeed Esmaili

12/20/2023 • EN

Finding the NVIDIA Driver Version on GKE

A quick guide to finding the NVIDIA GPU driver version running on a Google Kubernetes Engine (GKE) cluster using a kubectl command.

Driver Version Gke Gpu Kubernetes Nvidia

William Denniss

11/16/2023 • EN

Java On The GPU - Inside Java Newscast #58

Exploring how Java code can be executed on GPUs for high-performance computing and machine learning, covering challenges and potential APIs.

Gpu Java Machine Learning Openjdk Parallel Computing

Nicolai Parlog

8/10/2023 • EN

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide

A guide to participating in the NeurIPS 2023 LLM Efficiency Challenge, focusing on efficient fine-tuning of large language models on a single GPU.

Efficient Training Finetuning Gpu llm Neural Networks

Sebastian Raschka

6/27/2023 • EN

Boosting Performance of Java Programs by Running on GPUs and FPGAs

Introduces TornadoVM, an open-source framework for running Java programs on GPUs and FPGAs to boost performance without low-level code.

Fpga Gpu Java performance Tornadovm

Michael Inden

2/7/2023 • EN

Oversubscribing GPUs in Kubernetes

A technical guide on oversubscribing GPUs in Kubernetes using time slicing for development and light workloads, with setup instructions.

Gpu Kubernetes Nvidia Oversubscription Time Slicing

Jacob Tomlinson

1/15/2023 • EN

Training an XGBoost Classifier Using Cloud GPUs Without Worrying About Infrastructure

A guide to training XGBoost models on cloud GPUs using the Lightning AI framework, bypassing complex infrastructure setup.

cloud computing Gpu Lightning AI Machine Learning Xgboost

Sebastian Raschka

8/30/2022 • EN

Accelerating ETL on KubeFlow with RAPIDS

A guide to using RAPIDS to accelerate ETL and data processing workflows within a KubeFlow environment by leveraging GPUs.

Etl Gpu Kubeflow Kubernetes Rapids

Jacob Tomlinson

8/16/2022 • EN

Accelerate BERT inference with DeepSpeed-Inference on GPUs

Learn to optimize BERT and RoBERTa models for faster GPU inference using DeepSpeed-Inference, reducing latency from 30ms to 10ms.

Bert Deepspeed Inference Gpu Inference Optimization Transformers

Philipp Schmid

8/12/2022 • EN

How to check your NVIDIA driver and CUDA version in Kubernetes

Learn two methods to check NVIDIA driver and CUDA versions on Kubernetes nodes using node labels or running nvidia-smi in a pod.

Container Cuda Gpu Kubernetes Nvidia

Jacob Tomlinson

1/27/2022 • EN

Running Kubeflow inside Kind with GPU support

A guide to setting up Kubeflow for MLOps with GPU support in a local Kind Kubernetes cluster for development and testing.

Gpu Kind Kubeflow Kubernetes Nvidia

Jacob Tomlinson

1/25/2022 • EN

Quick hack: Adding GPU support to kind

A guide to hacking GPU support into the kind Kubernetes tool for local development and testing with NVIDIA hardware.

Container docker Gpu Kind Kubernetes

Jacob Tomlinson

4/3/2020 • EN

Building pyarrow with CUDA support

A step-by-step guide to building the pyarrow Python library with CUDA support using Docker on Ubuntu for GPU data processing.

Apache Arrow Cuda docker Gpu Pyarrow

Randy Zwitch

2/26/2020 • EN

Google Colab the free GPU/TPU Jupyter Notebook Service

An overview of Google Colab, a free cloud-based Jupyter notebook service with GPU/TPU access for machine learning and data science.

Google Colab Gpu Jupyter Notebook Python Tpu

Philipp Schmid

Gpu Articles

Big GPUs don't need big PCs

The Production Generative AI Stack: Architecture and Components

DGX Spark and Mac Mini for Local PyTorch Development

GPU vs CPU – Local Image Generation in C# with TransformersSharp

Running DeepSeek open reasoning models on GKE

How useful is GPU manufacturer TDP for estimating AI workload energy?

Running Python on a serverless GPU instance for machine learning inference

Finding the NVIDIA Driver Version on GKE

Java On The GPU - Inside Java Newscast #58

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide

Boosting Performance of Java Programs by Running on GPUs and FPGAs

Oversubscribing GPUs in Kubernetes

Training an XGBoost Classifier Using Cloud GPUs Without Worrying About Infrastructure

Accelerating ETL on KubeFlow with RAPIDS

Accelerate BERT inference with DeepSpeed-Inference on GPUs

How to check your NVIDIA driver and CUDA version in Kubernetes

Running Kubeflow inside Kind with GPU support

Quick hack: Adding GPU support to kind

Building pyarrow with CUDA support

Google Colab the free GPU/TPU Jupyter Notebook Service

Select Language