Neural Networks articles

7/11/2024 • EN

Questions about ARC Prize

An analysis of the ARC Prize AI benchmark, questioning if human-level intelligence can be achieved solely through deep learning and transformers.

artificial intelligence benchmark Deep Learning Neural Networks Transformer

Eric Jang

3/3/2024 • EN

All Roads Lead to Robotics

A robotics AI lead reflects on the field's future, discussing scaling robot autonomy with neural networks and parallels to large language models.

AI Training artificial intelligence Autonomy Neural Networks Robotics

Eric Jang

11/17/2023 • EN

libactivation on PyPI

Announcing libactivation, a new Python package on PyPI providing activation functions and their derivatives for machine learning and neural networks.

Activation Functions Machine Learning Neural Networks Pypi Python

Bastiaan Quast

9/15/2023 • EN

Optimizing LLMs From a Dataset Perspective

Strategies for improving LLM performance through dataset-centric fine-tuning, focusing on instruction datasets rather than model architecture changes.

Dataset Finetuning Instruction Tuning llm Neural Networks

Sebastian Raschka

9/15/2023 • EN

Optimizing LLMs From a Dataset Perspective

Explores dataset-centric strategies for fine-tuning LLMs, focusing on instruction datasets to improve model performance without altering architecture.

Dataset Finetuning Instruction Tuning llm Neural Networks

Sebastian Raschka

8/10/2023 • EN

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide

A guide to participating in the NeurIPS 2023 LLM Efficiency Challenge, focusing on efficient fine-tuning of large language models on a single GPU.

Efficient Training Finetuning Gpu llm Neural Networks

Sebastian Raschka

8/10/2023 • EN

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide

A guide to participating in the NeurIPS 2023 LLM Efficiency Challenge, covering setup, rules, and strategies for efficient LLM fine-tuning on limited hardware.

Finetuning Gpu Optimization LLM Efficiency Neural Networks Neurips

Sebastian Raschka

5/30/2023 • EN

Mortal Komputation: On Hinton's argument for superhuman AI.

Analyzes Geoffrey Hinton's technical argument comparing biological and digital intelligence, concluding digital AI will surpass human capabilities.

Agi AI Safety artificial intelligence Deep Learning Neural Networks

Ferenc Huszár

5/5/2023 • EN

Hello, Perceptron: An introduction to artificial neural networks

An introduction to artificial neural networks, explaining the perceptron as the simplest building block and its ability to learn basic logical functions.

Artificial Neural Networks boolean logic Machine Learning Neural Networks Perceptron

Matt Might

4/23/2023 • EN

Linear Diffusion: Building a Diffusion Model from linear Components

Introducing Linear Diffusion, a novel diffusion model built entirely from linear components for generating simple images like MNIST digits.

Diffusion Models generative ai Linear Models Machine Learning Neural Networks

Will Kurt

2/19/2023 • EN

ChatGPT Is Not a Blurry JPEG of the Web

Argues against the 'lossy compression' analogy for LLMs like ChatGPT, proposing instead that they are simulators creating temporary simulacra.

artificial intelligence Chatgpt large language models Neural Networks simulation

Domenic Denicola

2/9/2023 • EN

Understanding and Coding the Self-Attention Mechanism of Large Language Models From Scratch

A technical guide to coding the self-attention mechanism from scratch, as used in transformers and large language models.

Natural Language Processing Neural Networks Python Self Attention Transformer

Sebastian Raschka

2/9/2023 • EN

Understanding and Coding the Self-Attention Mechanism of Large Language Models From Scratch

A technical guide to coding the self-attention mechanism from scratch, as used in transformers and large language models.

Deep Learning Natural Language Processing Neural Networks Self Attention Transformer

Sebastian Raschka

2/5/2023 • EN

AI won’t make artists redundant - thanks to information theory

Argues that AI image generation won't replace human artists, using information theory to explain their unique creative value.

ai image generation Information Theory Neural Networks stable diffusion

Piotr Migdał

1/30/2023 • EN

GPT in 60 Lines of NumPy

A technical guide to implementing a GPT model from scratch using only 60 lines of NumPy code, including loading pre-trained GPT-2 weights.

Gpt Implementation Neural Networks Numpy Transformer

Jay Mody

1/27/2023 • EN

The Transformer Family Version 2.0

An updated, comprehensive overview of the Transformer architecture and its many recent improvements, including detailed notation and attention mechanisms.

Attention Mechanisms Deep Learning Natural Language Processing Neural Networks Transformer Architecture

Lilian Weng

1/5/2023 • EN

Open Source Highlights 2022 for Machine Learning & AI

A curated list of the top 10 open-source machine learning and AI projects released or updated in 2022, including PyTorch 2.0 and scikit-learn 1.2.

Deep Learning Machine Learning Neural Networks open source Pytorch

Sebastian Raschka

12/11/2022 • EN

Autoencoders and Diffusers: A Brief Comparison

Compares autoencoders and diffusers, explaining their architectures, learning paradigms, and key differences in deep learning.

Autoencoders Deep Learning Denoising Diffusion Models Neural Networks

Eugene Yan

10/22/2022 • EN

An Intuition for Attention

A technical explanation of the attention mechanism in transformers, building intuition from key-value lookups to the scaled dot product equation.

Attention Mechanism Deep Learning Machine Learning Neural Networks Transformers

Jay Mody

10/15/2022 • EN

Ahead Of AI, And What's Next?

Author announces the launch of 'Ahead of AI', a monthly newsletter covering AI trends, educational content, and personal updates on machine learning projects.

AI Newsletter Deep Learning Diffusion Models Machine Learning Neural Networks

Sebastian Raschka