INDEX
Explanations
references to strong negative emotions or concerns
instances of the word "fears" and its associated context
New Auto-Interp
Negative Logits
ced
-0.77
thumbnails
-0.75
arb
-0.74
nice
-0.74
urgy
-0.74
sample
-0.70
lease
-0.69
ann
-0.69
ergy
-0.68
cise
-0.68
POSITIVE LOGITS
fears
1.20
wart
0.87
lessly
0.87
lest
0.81
worries
0.75
warts
0.75
perceptions
0.74
fear
0.73
mong
0.73
fearing
0.72
Activations Density 0.007%