INDEX
Explanations
phrases related to fear
expressions of fear or apprehension
New Auto-Interp
Negative Logits
urgy
-0.88
versions
-0.87
artisan
-0.85
arb
-0.85
ced
-0.83
version
-0.82
ergy
-0.75
char
-0.75
co
-0.75
ighters
-0.74
POSITIVE LOGITS
feared
1.06
lessly
1.00
fears
0.86
lest
0.86
Ily
0.81
afraid
0.79
fearing
0.78
abroad
0.76
risome
0.75
aloud
0.74
Activations Density 0.005%