INDEX
Explanations
mentions of individuals in positions of authority or leadership, particularly those holding the title "deputy."
New Auto-Interp
Negative Logits
ulan
-0.16
kla
-0.16
_utilities
-0.15
usk
-0.15
Ñįлек
-0.14
绩
-0.14
{?>↵-0.14
zones
-0.14
orro
-0.14
ãĢģãĢģ
-0.14
POSITIVE LOGITS
ature
0.16
ship
0.16
ure
0.15
/full
0.15
ships
0.15
ny
0.14
ido
0.14
bh
0.14
ats
0.14
cum
0.14
Activations Density 0.026%