INDEX
Explanations
references to leadership roles and titles in various contexts
New Auto-Interp
Negative Logits
wap
-0.14
Interceptor
-0.14
udas
-0.14
osti
-0.13
colleagues
-0.13
alian
-0.13
iate
-0.13
thood
-0.13
antages
-0.13
Economist
-0.13
POSITIVE LOGITS
driving
0.37
brains
0.33
brains
0.28
force
0.28
-driving
0.28
Driving
0.27
co
0.27
creator
0.25
inst
0.24
Driving
0.23
Activations Density 0.135%