INDEX
Explanations
references to military actions or drone strikes
New Auto-Interp
Negative Logits
SourceChecksum
-0.74
+};
-0.59
Hochspringen
-0.57
abestanden
-0.56
abetes
-0.56
-------
-0.55
ilités
-0.54
OFDb
-0.53
PRWEB
-0.53
γων
-0.53
POSITIVE LOGITS
attack
1.81
attacks
1.63
attack
1.54
ATTACK
1.42
attacked
1.41
Attack
1.37
attacking
1.37
Attacks
1.31
attaque
1.27
attacks
1.25
Activations Density 0.204%