INDEX
Explanations
This neuron selectively activates on the phrase “meant to (be)”, i.e. the words “meant to” signaling destiny or purpose.
New Auto-Interp
Negative Logits
achine
-0.07
cade
-0.07
.chars
-0.07
royalties
-0.07
při
-0.06
igration
-0.06
-auto
-0.06
.listBox
-0.06
gorith
-0.06
_TOTAL
-0.06
POSITIVE LOGITS
meant
0.13
Must
0.08
means
0.07
beloved
0.07
doesnt
0.07
0.07
leaflet
0.06
facilitated
0.06
deliberate
0.06
меня
0.06
Activations Density 0.005%