Llm Articles
The Normalization of Deviance in AI
Explores the 'Normalization of Deviance' concept in AI safety, warning against complacency with LLM vulnerabilities like prompt injection.
Devstral 2
Mistral AI releases Devstral 2 and Devstral Small 2, two new open models focused on powering coding agents and software development tasks.
Under the hood of Canada Spends with Brendan Samek
An interview about Canada Spends, a project using Datasette, SQLite, and LLMs to make Canadian government financial data accessible and explorable.
Quoting Claude
A blog post analyzing a critical bug in Claude Code where a command accidentally deleted a user's home directory.
Prediction: AI will make formal verification go mainstream
AI is predicted to bring formal verification tools like Dafny and Verus into mainstream use, aided by LLMs making them more accessible.
Using LLMs at Oxide
Bryan Cantrill discusses applying Large Language Models (LLMs) at Oxide, evaluating them against the company's core values.
Quoting David Crespo
Tips from David Crespo on effectively using Claude Code for understanding codebases and automating tedious coding tasks.
A Technical Tour of the DeepSeek Models from V3 to V3.2
A technical analysis of the DeepSeek model series, from V3 to the latest V3.2, covering architecture, performance, and release timeline.
Claude 4.5 Opus' Soul Document
Anthropic's internal 'soul document' used to train Claude 4.5 Opus's personality and values has been confirmed and partially revealed.
deepseek-ai/DeepSeek-Math-V2
DeepSeek-Math-V2 is an open-source 685B parameter AI model that achieves gold medal performance on mathematical Olympiad problems.
llm-anthropic 0.23
Release of llm-anthropic 0.23 plugin adding support for Claude Opus 4.5 and its new thinking_effort option.
Quoting Claude Opus 4.5 system prompt
Analysis of a leaked system prompt for Claude Opus 4.5, discussing its content and the challenges of evaluating new LLMs.
Agent design is still hard
Armin Ronacher discusses challenges in AI agent design, including abstraction issues, testing difficulties, and API synchronization problems.
Non-determinism and ownership
A developer discusses the non-deterministic nature of LLMs like GitHub Copilot, arguing that while useful, they cannot take ownership of errors like a human teammate.
Fragments Nov 19
Martin Fowler discusses the latest Thoughtworks Technology Radar, AI's impact on programming, and his recent tech talks in Europe.
llm-gemini 0.27
New release of the llm-gemini plugin adds support for nested Pydantic schemas, YouTube URL attachments, and the latest Gemini 3 Pro model.
Connector-Based RAG With Live Confluence Data
A guide to building a connector-based RAG system that fetches live data from Confluence using its REST API and Java, avoiding stale embeddings.
The fate of “small” open source
Discusses the future of small open source libraries in the age of LLMs, questioning their relevance when AI can generate specific code.
llm-anthropic 0.22
Release of llm-anthropic plugin 0.22 with support for Claude's structured outputs and web search tool integration.