INDEX
Explanations
phrases related to fear and anxiety
references to fear and its implications
New Auto-Interp
Negative Logits
nice
-0.80
odore
-0.76
authenticated
-0.69
anmar
-0.68
ilib
-0.64
arbon
-0.64
arkable
-0.63
asty
-0.62
inka
-0.61
cise
-0.60
POSITIVE LOGITS
mong
1.44
lessness
1.34
lessly
1.27
fulness
1.17
crow
1.08
fully
1.04
wart
0.97
ful
0.91
warts
0.91
lest
0.89
Activations Density 0.032%