Deep Learning articles

2/4/2026 • EN

As Rocks May Think

A developer explores using AI coding agents like Claude to automate software development, research, and experimentation, focusing on implementing AlphaGo from scratch.

ai programming Alphago Claude Code Coding Agents Deep Learning

Eric Jang

1/31/2026 • EN

Deep Learning is Powerful Because It Makes Hard Things Easy - Reflections 10 Years On

A reflection on a decade-old blog post about deep learning, examining past predictions on architecture, scaling, and the field's evolution.

Deep Learning Machine Learning Neural Networks Scaling Laws Transformers

Ferenc Huszár

1/22/2026 • EN

Qwen3-TTS Family is Now Open Sourced: Voice Design, Clone, and Generation

Qwen3-TTS, a family of advanced multilingual text-to-speech models, is now open source, featuring voice cloning and description-based control.

Deep Learning Multilingual Models open source Text To Speech Voice Cloning

Simon Willison

1/21/2026 • EN

Leaving 1X

A robotics engineer reflects on leaving humanoid robotics company 1X, discussing the company's growth and the 'magical objects' driving AI progress.

ai Autonomy Deep Learning Robotics startup

Eric Jang

1/11/2026 • EN

You Only Look Once: 8 Years of Food Detection Evolution

A technical comparison of YOLO-based food detection from 2018 to 2026, showing the evolution of deep learning tooling and ease of use.

computer vision Deep Learning Machine Learning Object Detection Yolo

Benny Cheung

12/30/2025 • EN

LLM Research Papers: The 2025 List (July to December)

A curated list of notable LLM (Large Language Model) research papers published from July to December 2025, categorized by topic.

artificial intelligence Deep Learning llm Machine Learning Research Papers

Sebastian Raschka

12/15/2025 • EN

Elephant(s) in the room: Graph neural networks, embeddings, and foundation models in spatial data science

Explores the application of Graph Neural Networks, embeddings, and foundation models to spatial data science, with practical examples in R.

Deep Learning Embeddings Foundation Models Graph Neural Networks Spatial Data Science

Jakub Nowosad

5/10/2025 • EN

Coding LLMs from the Ground Up: A Complete Course

A course teaching how to code Large Language Models (LLMs) from scratch to deeply understand their inner workings and fundamentals.

Deep Learning llm Machine Learning Neural Networks Python

Sebastian Raschka

5/10/2025 • EN

Coding LLMs from the Ground Up: A Complete Course

A course teaching how to code Large Language Models from scratch to deeply understand their inner workings, with practical video tutorials.

Deep Learning From Scratch llm Machine Learning Neural Networks

Sebastian Raschka

5/4/2025 • EN

AI Terminology Explained – Know What You’re Talking About

Explains key AI terminology like AI, ML, deep learning, and LLMs to help engineers use the correct terms.

artificial intelligence Deep Learning llm Machine Learning Terminology

Florian Dedov

4/5/2025 • EN

A Journey from AI to LLMs and MCP - 1 - What Is AI and How It Evolved Into LLMs

Explores the evolution of AI from symbolic systems to modern Large Language Models (LLMs), detailing their capabilities and limitations.

artificial intelligence Deep Learning llm Machine Learning Model Context Protocol

Alex Merced

3/29/2025 • EN

First Look at Reasoning From Scratch: Chapter 1

An introduction to reasoning in Large Language Models, covering key concepts like chain-of-thought and methods to improve LLM reasoning abilities.

artificial intelligence Deep Learning llm Machine Learning Reasoning

Sebastian Raschka

3/17/2025 • EN

GPU Programming from Scratch

An AI researcher shares her journey into GPU programming and introduces WebGPU Puzzles, a browser-based tool for learning GPU fundamentals from scratch.

AI Research Deep Learning Gpu Programming Neural Networks Webgpu

Jeremy Howard

9/1/2024 • EN

Building LLMs from the Ground Up: A 3-hour Coding Workshop

A 3-hour coding workshop video covering the implementation, training, and use of Large Language Models (LLMs) from scratch.

Coding Workshop Deep Learning llm Machine Learning Transformer Architecture

Sebastian Raschka

7/29/2024 • EN

Learning and Building Deep Neural Networks with Kotlin

A technical article exploring deep neural networks by comparing classic computational methods to modern ML, using sine function calculation as an example and implementing it in Kotlin.

Backpropagation Deep Learning Kotlin Machine Learning Neural Networks

Michael Inden

7/19/2024 • EN

CARTE: toward table foundation models

Introduces CARTE, a foundation model for tabular data, explaining its architecture, pretraining on knowledge graphs, and results.

Deep Learning Foundation Models Pretraining Relational Databases tabular data

Gael Varoquaux

7/11/2024 • EN

AI-Extracted Asian Building Footprints

Analysis of a research paper detailing an AI model that extracted 281 million building footprints from satellite imagery across East Asia.

computer vision data processing Deep Learning Geospatial AI github

Mark Litwintschik

7/11/2024 • EN

Questions about ARC Prize

An analysis of the ARC Prize AI benchmark, questioning if human-level intelligence can be achieved solely through deep learning and transformers.

artificial intelligence benchmark Deep Learning Neural Networks Transformer

Eric Jang

4/12/2024 • EN

Diffusion Models for Video Generation

Explores the application of diffusion models to video generation, covering technical challenges, parameterization, and sampling methods.

Deep Learning Diffusion Models generative ai Machine Learning Video Generation

Lilian Weng

7/1/2023 • EN

Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch

A guide to 9 PyTorch techniques for drastically reducing memory usage when training vision transformers and LLMs, enabling training on consumer hardware.

Deep Learning memory optimization Model Training Pytorch Transformers

Sebastian Raschka

Deep Learning Articles

As Rocks May Think

Deep Learning is Powerful Because It Makes Hard Things Easy - Reflections 10 Years On

Qwen3-TTS Family is Now Open Sourced: Voice Design, Clone, and Generation

Leaving 1X

You Only Look Once: 8 Years of Food Detection Evolution

LLM Research Papers: The 2025 List (July to December)

Elephant(s) in the room: Graph neural networks, embeddings, and foundation models in spatial data science

Coding LLMs from the Ground Up: A Complete Course

Coding LLMs from the Ground Up: A Complete Course

AI Terminology Explained – Know What You’re Talking About

A Journey from AI to LLMs and MCP - 1 - What Is AI and How It Evolved Into LLMs

First Look at Reasoning From Scratch: Chapter 1

GPU Programming from Scratch

Building LLMs from the Ground Up: A 3-hour Coding Workshop

Learning and Building Deep Neural Networks with Kotlin

CARTE: toward table foundation models

AI-Extracted Asian Building Footprints

Questions about ARC Prize

Diffusion Models for Video Generation

Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch

Select Language