Why the answer is D
I whispered a wrong answer to a reasoning model and probed its residual stream to find out when it actually decides. Spoiler: sometimes before it starts "thinking."
A dial for a language model's mind
A single direction added to an LLM's activations dials a behavior up and down like a knob. Extracted on one 4090, running live in your browser. Drag the slider.
A stupid amount of work for a date
Rebuilding the lantern scene from Tangled as a walkable 3D world you can stand inside. Gaussian splats, a 480p ceiling, and one genuinely cursed NCCL error.