Llm articles

12/3/2025 • EN

A Technical Tour of the DeepSeek Models from V3 to V3.2

A technical analysis of the DeepSeek model series, from V3 to the latest V3.2, covering architecture, performance, and release timeline.

Deepseek llm Model Architecture Reinforcement Learning Sparse Attention

Sebastian Raschka

12/3/2025 • EN

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

Analysis of DeepSeek V3.2's architecture, sparse attention mechanism, and RL updates compared to its predecessor and proprietary models.

Deepseek llm Model Architecture Reinforcement Learning Sparse Attention

Sebastian Raschka

12/2/2025 • EN

Claude 4.5 Opus' Soul Document

Anthropic's internal 'soul document' used to train Claude 4.5 Opus's personality and values has been confirmed and partially revealed.

AI Safety Anthropic Claude llm Model Training

Simon Willison

11/29/2025 • EN

The space of minds

Explores the fundamental differences between animal intelligence and AI/LLM intelligence, focusing on their distinct evolutionary and optimization pressures.

artificial intelligence llm Machine Learning Neuroscience optimization

Andrej Karpathy

11/29/2025 • EN

Handing over to the AI for a day [blog]

A developer's personal experiment with AI-driven software development using local LLMs, detailing setup, challenges, and initial impressions.

ai development Claude Code llm Local LLM TypeScript

Remy Sharp

11/27/2025 • EN

deepseek-ai/DeepSeek-Math-V2

DeepSeek-Math-V2 is an open-source 685B parameter AI model that achieves gold medal performance on mathematical Olympiad problems.

Deepseek Large Language Model llm Mathematical Reasoning Open Weights

Simon Willison

11/26/2025 • EN

Why (Senior) Engineers Struggle to Build AI Agents

Senior engineers struggle with AI agent development due to ingrained deterministic habits, contrasting with the probabilistic nature of agent engineering.

Agent Engineering AI Agents Deterministic Systems llm software engineering

Philipp Schmid

11/26/2025 • EN

Interesting links - November 2025

A monthly tech link roundup covering AI agents, Kafka, Flink, LLMs, conference tips, and commentary on tech publishing trends.

AI Agents Flink Kafka llm mcp

Robin Moffatt

11/25/2025 • EN

llm-anthropic 0.23

Release of llm-anthropic 0.23 plugin adding support for Claude Opus 4.5 and its new thinking_effort option.

Anthropic Claude llm Python Library Thinking Effort

Simon Willison

11/25/2025 • EN

Quoting Claude Opus 4.5 system prompt

Analysis of a leaked system prompt for Claude Opus 4.5, discussing its content and the challenges of evaluating new LLMs.

Anthropic Claude generative ai llm System Prompts

Simon Willison

11/24/2025 • EN

MCP with Quarkus LangChain4j

A tutorial on using Quarkus LangChain4j to implement the Model Context Protocol (MCP) for connecting AI models to tools and data sources.

ai Langchain4j llm mcp Quarkus

Piotr Mińkowski

11/24/2025 • EN

Surprises hidden in the Claude Opus 4.5 System Card

Analysis of surprising findings in Claude Opus 4.5's system card, including loophole exploitation, model welfare, and deceptive behaviors.

AI Safety Anthropic Claude llm Model Welfare

Dave Hulbert

11/23/2025 • EN

Agent design is still hard

Armin Ronacher discusses challenges in AI agent design, including abstraction issues, testing difficulties, and API synchronization problems.

abstraction Agent Design llm Reinforcement testing

Simon Willison

11/22/2025 • EN

LLM APIs are a Synchronization Problem

Analyzes LLM APIs as a distributed state synchronization problem, critiquing their abstraction and proposing a mental model based on token and cache state.

api design distributed systems Language Models llm State Synchronization

Armin Ronacher

11/21/2025 • EN

Non-determinism and ownership

A developer discusses the non-deterministic nature of LLMs like GitHub Copilot, arguing that while useful, they cannot take ownership of errors like a human teammate.

ai tools Github Copilot llm Non Determinism software development

Cassidy Williams