INDEX
Explanations
words related to leadership positions or control
terms related to leadership and control positions
New Auto-Interp
Negative Logits
Interstitial
-0.91
Decre
-0.79
IMAGES
-0.75
Pse
-0.72
Americans
-0.70
ramid
-0.68
Memory
-0.67
birth
-0.66
adra
-0.65
Ident
-0.65
POSITIVE LOGITS
helm
1.41
idon
0.80
reins
0.78
fish
0.76
atop
0.70
spoiler
0.70
chair
0.70
eering
0.70
rigging
0.69
oint
0.69
Activations Density 0.007%