INDEX
Explanations
words related to fear
instances of the word "fear" in various contexts
New Auto-Interp
Negative Logits
nice
-0.90
endum
-0.74
issance
-0.71
cise
-0.70
arkable
-0.70
authenticated
-0.68
afort
-0.67
eret
-0.67
ded
-0.67
arius
-0.66
POSITIVE LOGITS
mong
1.25
lessly
1.22
lessness
1.11
crow
0.97
fulness
0.95
lest
0.90
fully
0.88
Mong
0.81
retaliation
0.78
wart
0.77
Activations Density 0.033%