INDEX
Explanations
threats and coercion
The neuron activates specifically on the past‐tense auxiliary “did” (including “didn’t”).
New Auto-Interp
Negative Logits
mexico
-0.07
(Layout
-0.07
173
-0.07
-awaited
-0.06
.dep
-0.06
.Microsoft
-0.06
giải
-0.06
],"
-0.06
Invocation
-0.06
miştir
-0.06
POSITIVE LOGITS
HANDLE
0.06
حکم
0.06
NaN
0.06
EOS
0.06
gameOver
0.06
presente
0.06
Vapor
0.06
golden
0.06
enclave
0.06
fır
0.06
Activations Density 0.024%