INDEX
Explanations
words related to fear or being afraid
instances of the word "feared" indicating anxiety or concern
New Auto-Interp
Negative Logits
arb
-0.96
urgy
-0.91
arist
-0.84
versions
-0.84
artisan
-0.81
ced
-0.81
version
-0.80
ysis
-0.79
ighters
-0.75
ammy
-0.75
POSITIVE LOGITS
lessly
0.98
feared
0.86
fears
0.84
lest
0.82
Ily
0.80
afraid
0.79
Esther
0.71
fearing
0.71
fully
0.70
FUL
0.69
Activations Density 0.009%