INDEX
Explanations
mentions of police or military ranks
mentions of military ranks, specifically "Sgt" (Sergeant)
New Auto-Interp
Negative Logits
ahime
-0.80
theless
-0.78
rights
-0.75
女
-0.73
phal
-0.71
stage
-0.70
FORE
-0.68
hur
-0.66
topic
-0.65
1080
-0.65
POSITIVE LOGITS
geant
1.24
Sgt
1.13
Sergeant
0.92
gt
0.92
sergeant
0.90
veland
0.83
Pepper
0.82
Maj
0.79
imen
0.79
illery
0.78
Activations Density 0.020%