INDEX
Explanations
explicit references to military actions and tactics
New Auto-Interp
Negative Logits
abin
-0.18
adia
-0.16
VRT
-0.16
داخÙĦÛĮ
-0.15
okres
-0.15
رÙĤ
-0.15
atoi
-0.15
ADR
-0.15
apos
-0.14
unintended
-0.14
POSITIVE LOGITS
attack
0.56
attacks
0.45
attack
0.41
Attack
0.39
assault
0.38
strike
0.34
Attacks
0.34
attacks
0.33
assaults
0.33
ATTACK
0.33
Activations Density 0.200%