INDEX
Explanations
mentions of people's former professional positions and affiliations
references to individuals' previous roles or positions
New Auto-Interp
Negative Logits
anium
-0.83
llers
-0.81
Rs
-0.80
places
-0.78
abouts
-0.78
ombies
-0.77
tics
-0.77
houses
-0.76
fixes
-0.76
iths
-0.75
POSITIVE LOGITS
staffer
1.17
employee
1.15
classmate
1.14
aide
1.13
member
1.11
adviser
1.10
prosecutor
1.09
colleague
1.06
deputy
1.02
diplomat
1.02
Activations Density 0.076%