INDEX
Explanations
This neuron responds to occurrences of the third-person masculine pronoun “he.”
New Auto-Interp
Negative Logits
strcat
-0.07
講
-0.07
_eng
-0.07
pode
-0.06
pomocí
-0.06
noveller
-0.06
alone
-0.06
Algorithms
-0.06
Options
-0.06
processo
-0.06
POSITIVE LOGITS
Gemini
0.08
的地
0.07
setType
0.06
vigorous
0.06
ัม
0.06
ALIGN
0.06
.City
0.06
?",
0.06
kiện
0.06
ashion
0.06
Activations Density 0.031%