INDEX
Explanations
references to positions of authority or titles in an organization
New Auto-Interp
Negative Logits
ced
-0.16
cede
-0.15
inski
-0.15
adol
-0.14
Aires
-0.14
.Builder
-0.14
ÑĥÑĤи
-0.14
schö
-0.13
Af
-0.13
oyer
-0.13
POSITIVE LOGITS
evt
0.16
اذ
0.15
ifa
0.15
pike
0.15
eba
0.14
oldt
0.14
sville
0.14
565
0.14
uncture
0.14
sole
0.14
Activations Density 0.050%