Deepseek articles

12/30/2025 • EN

The State Of LLMs 2025: Progress, Problems, and Predictions

A 2025 year-in-review analysis of large language models (LLMs), covering key developments in reasoning, architecture, costs, and predictions for 2026.

artificial intelligence Deepseek llm Machine Learning Reasoning Models

Sebastian Raschka

12/30/2025 • EN

The State Of LLMs 2025: Progress, Problems, and Predictions

A 2025 year-in-review of Large Language Models, covering major developments in reasoning, architecture, costs, and predictions for 2026.

AI Research Deepseek llm Reasoning Models Reinforcement Learning

Sebastian Raschka

12/3/2025 • EN

A Technical Tour of the DeepSeek Models from V3 to V3.2

A technical analysis of the DeepSeek model series, from V3 to the latest V3.2, covering architecture, performance, and release timeline.

Deepseek llm Model Architecture Reinforcement Learning Sparse Attention

Sebastian Raschka

12/3/2025 • EN

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

A technical analysis of DeepSeek V3.2's architecture, sparse attention, and reinforcement learning updates, comparing it to other flagship AI models.

Deepseek LLM Architecture Open Weight Models Reinforcement Learning Sparse Attention

Sebastian Raschka

12/3/2025 • EN

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

Analysis of DeepSeek V3.2's architecture, sparse attention mechanism, and RL updates compared to its predecessor and proprietary models.

Deepseek llm Model Architecture Reinforcement Learning Sparse Attention

Sebastian Raschka

11/27/2025 • EN

deepseek-ai/DeepSeek-Math-V2

DeepSeek-Math-V2 is an open-source 685B parameter AI model that achieves gold medal performance on mathematical Olympiad problems.

Deepseek Large Language Model llm Mathematical Reasoning Open Weights

Simon Willison

6/20/2025 • EN

AIs that break down questions reason better

Explores how advanced AIs use 'chains of thought' reasoning to break complex problems into simpler steps, improving accuracy and performance.

artificial intelligence Conversational AI Deepseek Language Models Reasoning Models

Gael Varoquaux

2/26/2025 • EN

Running DeepSeek open reasoning models on GKE

A technical guide on deploying DeepSeek's open reasoning AI models on Google Kubernetes Engine (GKE) using vLLM and a Gradio interface.

Deepseek Gke Gpu Inference Kubernetes

William Denniss

2/5/2025 • EN

Understanding Reasoning LLMs

Explores four main approaches to building and enhancing reasoning capabilities in Large Language Models (LLMs) for complex tasks.

Deepseek LLM Reasoning Model Specialization Reinforcement Learning Supervised Finetuning

Sebastian Raschka

1/28/2025 • EN

Install Deepseek on Linux

A tutorial on installing the Deepseek AI model locally on a Linux machine using the Ollama platform.

ai Deepseek linux llm Ollama

Marko Denic

1/17/2025 • EN

Bite: How Deepseek R1 was trained

Explains the training of DeepSeek-R1, focusing on the Group Relative Policy Optimization (GRPO) reinforcement learning method.

Deepseek Grpo LLM Training Proximal Policy Optimization Reinforcement Learning

Philipp Schmid

Deepseek Articles

The State Of LLMs 2025: Progress, Problems, and Predictions

The State Of LLMs 2025: Progress, Problems, and Predictions

A Technical Tour of the DeepSeek Models from V3 to V3.2

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

deepseek-ai/DeepSeek-Math-V2

AIs that break down questions reason better

Running DeepSeek open reasoning models on GKE

Understanding Reasoning LLMs

Install Deepseek on Linux

Bite: How Deepseek R1 was trained

Select Language

We use cookies