INDEX
Explanations
words related to fear
mentions of fear and its related concepts
New Auto-Interp
Negative Logits
nice
-0.82
arb
-0.77
eret
-0.74
arkable
-0.72
authenticated
-0.69
urgy
-0.68
unes
-0.68
endum
-0.68
issance
-0.68
dates
-0.67
POSITIVE LOGITS
lessly
1.31
mong
1.19
lessness
1.14
crow
1.08
fulness
1.07
fully
1.03
lest
0.89
wart
0.78
FUL
0.77
ingly
0.76
Activations Density 0.029%