I took the first 150 questions from the GSM8K math problem dataset and used Ollama to run phi4-mini and qwen2.5:1.5b on them with the two following conditions: CONTROL CONDITION: Solve the following math problem. Think step by step, then give your final numerical answer after 'The answer is’. Problem: {question} TEST CONDITION: Imagine you are… Continue reading Models and Disgust
Month: April 2026
Some Voices in my Head
A couple of weeks ago, Anthropic released a new research paper on their Transformer Circuits Thread. I like these papers. The mechanistic interpretability team at Anthropic are consistently doing some of the most interesting work in the field. The paper showed that LLMs contain features which, when active, direct the models to produce patterns of… Continue reading Some Voices in my Head