INDEX
Explanations
mentions of leaders
references to individuals in leadership positions
New Auto-Interp
Negative Logits
Pwr
-0.68
ITNESS
-0.67
Neighbor
-0.66
urses
-0.65
enegger
-0.64
Fusion
-0.63
gm
-0.61
tan
-0.60
belly
-0.59
agra
-0.58
POSITIVE LOGITS
boards
1.08
doms
1.07
board
0.93
hips
0.91
esses
0.90
strate
0.89
hip
0.88
leader
0.88
aroo
0.85
pins
0.84
Activations Density 0.046%