INDEX
Explanations
preparation
narratives involving romantic encounters and relationships.
The neuron fires on tokens describing preparatory or setup actions (e.g. “preparing,” “making,” “ensures,” “sets”) in a narrative.
New Auto-Interp
Negative Logits
daughter
-0.07
Body
-0.07
_suffix
-0.06
khóa
-0.06
indexed
-0.06
dropping
-0.06
-un
-0.06
,id
-0.06
evaluations
-0.05
�
-0.05
POSITIVE LOGITS
كيل
0.07
*g
0.07
.CompilerServices
0.07
501
0.06
ètres
0.06
DBNull
0.06
िछ
0.06
disastr
0.06
ريس
0.06
ffb
0.06
Activations Density 0.037%