INDEX
Explanations
phrases related to rules, authority, power, or control
terms associated with authority and leadership transitions
New Auto-Interp
Negative Logits
ertodd
-0.71
hammad
-0.71
contrace
-0.61
FFER
-0.61
Humanity
-0.61
Quotes
-0.61
defe
-0.59
Gre
-0.59
Kin
-0.58
endix
-0.58
POSITIVE LOGITS
pin
0.89
s
0.88
ited
0.84
uin
0.79
nant
0.78
esses
0.78
pins
0.78
unders
0.77
ping
0.77
iever
0.75
Activations Density 0.015%