INDEX
Explanations
mentions of individuals in positions of authority or leadership
references to leadership positions or titles
New Auto-Interp
Negative Logits
Pwr
-0.75
urses
-0.70
Fusion
-0.69
zl
-0.66
ogene
-0.64
usion
-0.64
berra
-0.63
belly
-0.63
Leilan
-0.63
nep
-0.63
POSITIVE LOGITS
boards
1.10
board
0.97
doms
0.92
leader
0.91
aroo
0.90
pins
0.90
pin
0.83
wcs
0.81
pieces
0.79
stration
0.78
Activations Density 0.031%