INDEX
Explanations
The neuron activates on occurrences of the first‐person pronoun “I.”
New Auto-Interp
Negative Logits
�示
-0.07
Numbers
-0.07
ftp
-0.07
lx
-0.07
.AreEqual
-0.07
.fits
-0.06
(filepath
-0.06
BAT
-0.06
shipping
-0.06
_short
-0.06
POSITIVE LOGITS
inform
0.06
threatening
0.06
Amer
0.06
าค
0.06
nikdo
0.06
Hook
0.06
Brittany
0.06
IRC
0.06
%M
0.06
pekt
0.06
Activations Density 0.015%