INDEX
Explanations
words related to fear and concern
expressions of apprehension or concern
New Auto-Interp
Negative Logits
issance
-0.83
estone
-0.79
nice
-0.77
putable
-0.75
afort
-0.72
emark
-0.72
endum
-0.72
ysis
-0.69
versions
-0.68
exc
-0.68
POSITIVE LOGITS
lessly
1.03
lest
1.00
retribution
0.90
retaliation
0.80
repercussions
0.74
fears
0.73
complicity
0.71
jeopard
0.70
afraid
0.69
Esther
0.69
Activations Density 0.034%