INDEX
Explanations
words related to fear, dread, or anxiety
expressions of fear and anxiety
New Auto-Interp
Negative Logits
cius
-0.75
Æ
-0.68
Reviewer
-0.67
OECD
-0.67
venants
-0.66
ropri
-0.66
Transparency
-0.65
Nile
-0.65
udi
-0.62
ARB
-0.61
POSITIVE LOGITS
locks
1.28
locked
0.97
fully
0.94
etheless
0.91
eful
0.82
mare
0.81
dread
0.80
nant
0.79
mares
0.78
fulness
0.76
Activations Density 0.029%