How to think about evals
A blog post summarizing key concepts from an AI Evals course, focusing on mental models like the 'Three Gulfs' for improving LLM applications.
A blog post summarizing key concepts from an AI Evals course, focusing on mental models like the 'Three Gulfs' for improving LLM applications.
An analysis of key qualities that define excellent non-corporate technical blogs, including tackling complex topics and showing working code.
Explains second-order thinking, a mental model for considering long-term consequences of actions, with examples from software engineering and management.
Explores how unexpected software behavior, or 'That's funny...' moments, are key opportunities for learning and improving mental models of code.
A reflection on how differing personal and professional contexts shape decision-making, especially in software architecture and team collaboration.