INDEX
Explanations
the word "moment."
the presence of the substring "mo" in words
New Auto-Interp
Negative Logits
Ĭ
-0.64
Ash
-0.63
forged
-0.62
Parenthood
-0.61
Dat
-0.60
belonging
-0.59
Paragu
-0.59
graves
-0.59
deficit
-0.58
sockets
-0.58
POSITIVE LOGITS
mo
4.57
mos
1.90
MO
1.90
webkit
1.74
Mo
1.59
mo
1.54
mon
1.33
emo
1.30
mic
1.29
ma
1.29
Activations Density 0.014%