INDEX
Explanations
references to military personnel and symbols related to them
New Auto-Interp
Negative Logits
RTC
-0.16
police
-0.15
OPS
-0.15
erton
-0.14
WithValue
-0.14
Police
-0.14
assis
-0.14
.ud
-0.14
distraction
-0.13
cancel
-0.13
POSITIVE LOGITS
POW
0.51
prisoner
0.49
captured
0.48
prisoners
0.46
captive
0.45
capture
0.44
capt
0.44
Prison
0.43
captures
0.43
captivity
0.42
Activations Density 0.079%