Neural Networks articles

9/8/2022 • EN

Some Math behind Neural Tangent Kernel

A deep dive into the Neural Tangent Kernel (NTK) theory, explaining the math behind why wide neural networks converge during gradient descent training.

Deep Learning Theory mathematics Neural Networks Neural Tangent Kernel optimization

Lilian Weng

7/23/2022 • EN

How Can We Make Robotics More like Generative Modeling?

A roboticist argues for scaling robotics research like generative AI, focusing on data quality and iteration over algorithms for better generalization.

Deep Learning Generative Modeling Machine Learning Neural Networks Robotics

Eric Jang

7/5/2022 • EN

No, We Don't Have to Choose Batch Sizes As Powers Of 2

Challenges the common practice of using powers of 2 for neural network batch sizes, examining the theory and practical benchmarks.

Batch Size Deep Learning Gpu Optimization memory alignment Neural Networks

Sebastian Raschka

7/5/2022 • EN

No, We Don't Have to Choose Batch Sizes As Powers Of 2

Examines the common practice of using powers of 2 for neural network batch sizes, questioning its necessity with practical and theoretical insights.

Batch Size Deep Learning Gpu Optimization memory alignment Neural Networks

Sebastian Raschka

6/21/2022 • EN

Convert Transformers to ONNX with Hugging Face Optimum

A guide on converting Hugging Face Transformers models to the ONNX format using the Optimum library for optimized deployment.

Hugging Face Optimum Model Conversion Neural Networks Onnx Transformers

Philipp Schmid

5/18/2022 • EN

Running PyTorch on the M1 GPU

A hands-on review of PyTorch's new M1 GPU support, including installation steps and performance benchmarks for deep learning tasks.

Deep Learning M1 Gpu Mac Neural Networks Pytorch

Sebastian Raschka

5/18/2022 • EN

Running PyTorch on the M1 GPU

A hands-on review and benchmark of PyTorch's new official GPU support for Apple's M1 chips, covering installation and performance.

Deep Learning M1 Gpu Mac Neural Networks Pytorch

Sebastian Raschka

4/4/2022 • EN

Losses Learned

Explains cross-entropy loss in PyTorch for binary and multiclass classification, highlighting common implementation pitfalls and best practices.

Binary Classification Cross Entropy Loss Deep Learning Neural Networks Pytorch

Sebastian Raschka

2/25/2022 • EN

Machine Learning with PyTorch and Scikit-Learn

Author announces a new machine learning book covering scikit-learn, deep learning with PyTorch, neural networks, and reinforcement learning.

Deep Learning Machine Learning Neural Networks Pytorch Scikit Learn

Sebastian Raschka

2/25/2022 • EN

Machine Learning with PyTorch and Scikit-Learn

Announcing a new book on machine learning, covering fundamentals with scikit-learn and deep learning with PyTorch, including neural networks from scratch.

Deep Learning Machine Learning Neural Networks Pytorch Scikit Learn

Sebastian Raschka

2/10/2022 • EN

EvoJAX: A Hardware-Accelerated Neuroevolution

EvoJAX is a hardware-accelerated neuroevolution toolkit built on JAX for running parallel evolution experiments on TPUs/GPUs.

Evolutionary Algorithms Hardware Acceleration Jax Neural Networks Neuroevolution

David Ha

11/18/2021 • EN

Permutation-Invariant Neural Networks for Reinforcement Learning

Introduces permutation-invariant neural networks for RL agents, enabling robustness to shuffled, noisy, or incomplete sensory inputs.

Neural Networks Permutation Invariance Reinforcement Learning Robustness Sensory Substitution

David Ha

9/24/2021 • EN

How to Train Really Large Models on Many GPUs?

Explores parallelism techniques and memory optimization strategies for training massive neural networks across multiple GPUs.

Distributed Training Gpu Memory large language models Model Parallelism Neural Networks

Lilian Weng

7/11/2021 • EN

What are Diffusion Models?

An in-depth technical explanation of diffusion models, a class of generative AI models that create data by reversing a noise-adding process.

Deep Learning Diffusion Models Generative Models Machine Learning Neural Networks

Lilian Weng

7/9/2021 • EN

Introduction to Deep Learning

A comprehensive deep learning course covering fundamentals, neural networks, computer vision, and generative models using PyTorch.

computer vision Deep Learning Machine Learning Neural Networks Pytorch

Sebastian Raschka

7/9/2021 • EN

Introduction to Deep Learning

A comprehensive deep learning course overview with PyTorch tutorials, covering fundamentals, neural networks, and advanced topics like CNNs and GANs.

computer vision Deep Learning Machine Learning Neural Networks Pytorch

Sebastian Raschka

1/21/2021 • EN

Book Review: Deep Learning With PyTorch

A review of the book 'Deep Learning with PyTorch', covering its structure, content, and suitability for students and beginners in deep learning.

book review Deep Learning Machine Learning Neural Networks Pytorch

Sebastian Raschka

1/21/2021 • EN

Book Review: Deep Learning With PyTorch

A detailed review of the book 'Deep Learning with PyTorch,' covering its structure, content, and suitability for students and practitioners.

book review Deep Learning Machine Learning Neural Networks Pytorch

Sebastian Raschka

1/19/2021 • EN

Finding the Words to Say: Hidden State Visualizations for Language Models

Explores visualizing hidden states in Transformer language models to understand their internal decision-making process during text generation.

Hidden States Language Models Model Visualization Neural Networks Transformer Models

Jay Alammar

12/17/2020 • EN

Interfaces for Explaining Transformer Language Models

Explores interactive methods for interpreting transformer language models, focusing on input saliency and neuron activation analysis.

Interpretability Language Models Neural Networks NLP Transformer

Jay Alammar

Neural Networks Articles

Some Math behind Neural Tangent Kernel

How Can We Make Robotics More like Generative Modeling?

No, We Don't Have to Choose Batch Sizes As Powers Of 2

No, We Don't Have to Choose Batch Sizes As Powers Of 2

Convert Transformers to ONNX with Hugging Face Optimum

Running PyTorch on the M1 GPU

Running PyTorch on the M1 GPU

Losses Learned

Machine Learning with PyTorch and Scikit-Learn

Machine Learning with PyTorch and Scikit-Learn

EvoJAX: A Hardware-Accelerated Neuroevolution

Permutation-Invariant Neural Networks for Reinforcement Learning

How to Train Really Large Models on Many GPUs?

What are Diffusion Models?

Introduction to Deep Learning

Introduction to Deep Learning

Book Review: Deep Learning With PyTorch

Book Review: Deep Learning With PyTorch

Finding the Words to Say: Hidden State Visualizations for Language Models

Interfaces for Explaining Transformer Language Models

Select Language

We use cookies