INDEX
Explanations
words related to power, control, and authority
terms related to dominance and control in various contexts
New Auto-Interp
Negative Logits
ead
-0.77
enegger
-0.74
pt
-0.69
abet
-0.67
idan
-0.67
endment
-0.67
ange
-0.66
Afee
-0.64
den
-0.64
gnu
-0.63
POSITIVE LOGITS
mentality
0.71
hierarch
0.71
bidder
0.70
mindset
0.69
headlines
0.68
precedence
0.67
dominate
0.67
charge
0.66
dominated
0.65
mind
0.64
Activations Density 0.078%