INDEX
Explanations
terms associated with fear or anxiety
New Auto-Interp
Negative Logits
sheer
-0.16
cott
-0.15
ää
-0.15
Äįka
-0.15
zers
-0.15
lass
-0.14
preter
-0.14
emme
-0.13
maids
-0.13
orest
-0.13
POSITIVE LOGITS
sdale
0.17
ific
0.17
etz
0.16
etched
0.16
edback
0.16
ified
0.15
chwitz
0.15
opus
0.15
iej
0.15
IENCE
0.15
Activations Density 0.118%