INDEX
Explanations
phrases related to leadership or individuals in positions of authority
occurrences of the word "leader."
New Auto-Interp
Negative Logits
Neighbor
-0.66
\/\/
-0.64
ascade
-0.63
ITNESS
-0.62
Pwr
-0.61
ALLY
-0.60
agra
-0.59
Fusion
-0.59
LV
-0.58
Collider
-0.57
POSITIVE LOGITS
boards
1.37
board
1.26
doms
1.09
pins
0.94
leader
0.93
aroo
0.92
hips
0.90
pin
0.90
esses
0.90
less
0.87
Activations Density 0.067%