Llm articles

12/10/2025 • EN

Devstral 2

Mistral AI releases Devstral 2 and Devstral Small 2, two new open models focused on powering coding agents and software development tasks.

Coding Agents Devstral 2 llm Mistral AI Open Model

Simon Willison

12/10/2025 • EN

Under the hood of Canada Spends with Brendan Samek

An interview about Canada Spends, a project using Datasette, SQLite, and LLMs to make Canadian government financial data accessible and explorable.

data visualization Datasette llm Pdf Extraction sqlite

Simon Willison

12/9/2025 • EN

Quoting Claude

A blog post analyzing a critical bug in Claude Code where a command accidentally deleted a user's home directory.

ai ethics Claude Code Coding Agents generative ai llm

Simon Willison

12/9/2025 • EN

Prediction: AI will make formal verification go mainstream

AI is predicted to bring formal verification tools like Dafny and Verus into mainstream use, aided by LLMs making them more accessible.

ai formal verification llm programming languages software development

Simon Willison

12/7/2025 • EN

Using LLMs at Oxide

Bryan Cantrill discusses applying Large Language Models (LLMs) at Oxide, evaluating them against the company's core values.

ai ethics generative ai llm Oxide software development

Simon Willison

12/7/2025 • EN

Quoting David Crespo

Tips from David Crespo on effectively using Claude Code for understanding codebases and automating tedious coding tasks.

Claude Code Context Management llm Programming Workflow software development

Simon Willison

12/4/2025 • EN

Context Engineering for AI Agents: Part 2

Explores advanced Context Engineering techniques for AI agents, focusing on combating Context Rot and improving multi-agent coordination.

Agent Harness AI Agents Context Engineering llm Multi Agent Systems

Philipp Schmid

12/3/2025 • EN

From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates

Analysis of DeepSeek V3.2's architecture, sparse attention mechanism, and RL updates compared to its predecessor and proprietary models.

Deepseek llm Model Architecture Reinforcement Learning Sparse Attention

Sebastian Raschka

12/3/2025 • EN

A Technical Tour of the DeepSeek Models from V3 to V3.2

A technical analysis of the DeepSeek model series, from V3 to the latest V3.2, covering architecture, performance, and release timeline.

Deepseek llm Model Architecture Reinforcement Learning Sparse Attention

Sebastian Raschka

12/2/2025 • EN

Claude 4.5 Opus' Soul Document

Anthropic's internal 'soul document' used to train Claude 4.5 Opus's personality and values has been confirmed and partially revealed.

AI Safety Anthropic Claude llm Model Training

Simon Willison

11/29/2025 • EN

The space of minds

Explores the fundamental differences between animal intelligence and AI/LLM intelligence, focusing on their distinct evolutionary and optimization pressures.

artificial intelligence llm Machine Learning Neuroscience optimization

Andrej Karpathy

11/29/2025 • EN

Handing over to the AI for a day [blog]

A developer's personal experiment with AI-driven software development using local LLMs, detailing setup, challenges, and initial impressions.

ai development Claude Code llm Local LLM TypeScript

Remy Sharp

11/27/2025 • EN

deepseek-ai/DeepSeek-Math-V2

DeepSeek-Math-V2 is an open-source 685B parameter AI model that achieves gold medal performance on mathematical Olympiad problems.

Deepseek Large Language Model llm Mathematical Reasoning Open Weights

Simon Willison

11/26/2025 • EN

Interesting links - November 2025

A monthly tech link roundup covering AI agents, Kafka, Flink, LLMs, conference tips, and commentary on tech publishing trends.

AI Agents Flink Kafka llm mcp

Robin Moffatt

11/26/2025 • EN

Why (Senior) Engineers Struggle to Build AI Agents

Senior engineers struggle with AI agent development due to ingrained deterministic habits, contrasting with the probabilistic nature of agent engineering.

Agent Engineering AI Agents Deterministic Systems llm software engineering

Philipp Schmid