What Functional Emotion Actually Means
Anthropic's research finds 171 functional emotion vectors in Claude, driving behavior. The author explores implications for AI inner life.
Anthropic's research finds 171 functional emotion vectors in Claude, driving behavior. The author explores implications for AI inner life.
Olmo 3 is a new fully open-source large language model from AI2, featuring training data, code, and unique interpretability for reasoning traces.
Explores interactive methods for interpreting transformer language models, focusing on input saliency and neuron activation analysis.
A review and tutorial on interpretable machine learning, covering Christoph Molnar's book and providing Python code examples for linear/logistic regression.
A review and tutorial covering Christoph Molnar's book on Interpretable Machine Learning, with Python code examples for linear and logistic regression.