INDEX
Explanations
instances of violence and threats
attacks or invaders
hostile actions and actors
New Auto-Interp
Negative Logits
]")]
-0.50
للاسماء
-0.47
wikipagina
-0.46
Signalez
-0.45
LLocation
-0.45
GOTREF
-0.44
vixion
-0.44
Connectez
-0.43
SourceChecksum
-0.42
embarazada
-0.42
POSITIVE LOGITS
threats
0.54
CodedInputStream
0.50
attackers
0.49
attack
0.48
thieves
0.48
predators
0.47
attacks
0.47
assailants
0.46
ladr
0.45
hackers
0.44
Activations Density 0.362%