INDEX
Explanations
This neuron responds to second-person pronouns and phrases directly addressing the reader (e.g. “you,” “your”).
New Auto-Interp
Negative Logits
Gods
-0.07
EMPTY
-0.07
(Event
-0.06
possibilities
-0.06
Esper
-0.06
안
-0.06
waves
-0.06
città
-0.06
pairs
-0.06
:Array
-0.06
POSITIVE LOGITS
наче
0.07
-contained
0.07
ęki
0.07
loneliness
0.06
iliki
0.06
받
0.06
цем
0.06
_MIC
0.06
breadcrumb
0.06
اورپ
0.06
Activations Density 0.188%