INDEX
Explanations
words related to political or organizational leadership
references to political or organizational leaders
New Auto-Interp
Negative Logits
nery
-0.70
ITNESS
-0.68
INGTON
-0.65
nder
-0.62
awar
-0.61
Oo
-0.61
Moor
-0.60
oute
-0.60
iverse
-0.59
isher
-0.59
POSITIVE LOGITS
hips
1.07
hip
1.03
doms
0.98
cius
0.87
paces
0.83
stration
0.83
wcs
0.81
negotiator
0.76
esses
0.73
pin
0.72
Activations Density 0.032%