INDEX
Explanations
expressions of fear or anxiety
New Auto-Interp
Negative Logits
weis
-0.18
elor
-0.17
icher
-0.16
esto
-0.15
cznie
-0.15
ills
-0.15
manship
-0.14
iferay
-0.14
eyer
-0.14
vre
-0.14
POSITIVE LOGITS
hã
0.28
mong
0.18
scare
0.18
Fear
0.18
æĢĸ
0.18
scared
0.17
lessly
0.17
æĢķ
0.17
Howell
0.16
Fear
0.16
Activations Density 0.019%