INDEX
Explanations
mentions of military ranks and law enforcement titles
New Auto-Interp
Negative Logits
ahime
-0.95
theless
-0.88
rights
-0.79
FORE
-0.70
GPU
-0.69
OPA
-0.69
topic
-0.67
trak
-0.66
bull
-0.66
phal
-0.65
POSITIVE LOGITS
geant
1.21
Sgt
1.08
Sergeant
0.97
Maj
0.89
Lt
0.84
Pepper
0.82
gt
0.81
sergeant
0.81
Lance
0.80
chief
0.79
Activations Density 5.887%