INDEX
Explanations
This neuron activates on obligation statements phrased in the passive voice—especially the “need to be taken” construction.
New Auto-Interp
Negative Logits
policeman
-0.07
foundland
-0.07
alc
-0.07
Lebanon
-0.06
Parking
-0.06
domest
-0.06
commute
-0.06
choices
-0.06
.djang
-0.06
Baz
-0.06
POSITIVE LOGITS
renamed
0.07
] ↵ ↵
0.07
===============
0.06
―
0.06
↵ ↵ ↵
0.06
] ↵
0.06
ọ
0.06
oky
0.06
особливо
0.06
�
0.06
Activations Density 0.031%