INDEX
Explanations
words related to courage and bravery
New Auto-Interp
Negative Logits
urgy
-0.72
foreseen
-0.65
upon
-0.65
administrations
-0.63
arate
-0.63
dated
-0.63
ipal
-0.62
opathy
-0.62
Controlled
-0.62
apse
-0.61
POSITIVE LOGITS
ly
1.16
souls
1.00
heart
0.94
faced
0.91
enough
0.84
brave
0.84
glers
0.79
courage
0.73
ness
0.73
gest
0.70
Activations Density 0.012%