INDEX
Explanations
Thinking/calculation
The neuron detects “chain‐of‐thought” prompt language—phrases like “think … step by step.”
New Auto-Interp
Negative Logits
.faceVertexUvs
-0.07
icts
-0.07
-turn
-0.07
规范
-0.07
Playlist
-0.06
распрост
-0.06
enn
-0.06
ி
-0.06
层
-0.06
进一步
-0.06
POSITIVE LOGITS
Hang
0.06
Sexual
0.06
eb
0.06
relação
0.06
(dynamic
0.06
yolc
0.06
Sergio
0.06
0.06
ンス
0.06
Pot
0.06
Activations Density 0.002%