INDEX
Explanations
code snippets
The neuron is primarily picking up on first-person references (“I”, “am”, “not”, etc.) and self-descriptive statements by the author.
New Auto-Interp
Negative Logits
427
-0.06
Memphis
-0.06
Barry
-0.06
Mu
-0.06
Symptoms
-0.06
forc
-0.06
Mu
-0.06
cis
-0.06
MH
-0.06
�
-0.06
POSITIVE LOGITS
_finder
0.06
bakımından
0.06
CREATED
0.06
RO
0.06
оген
0.06
empo
0.06
olo
0.06
deceit
0.06
producto
0.06
convolution
0.06
Activations Density 0.092%