INDEX
Explanations
terms related to attackers and their actions
references to an "attacker" in various contexts
New Auto-Interp
Negative Logits
zl
-1.01
sit
-0.83
wheel
-0.82
Balt
-0.79
REE
-0.78
umph
-0.74
Revival
-0.73
carb
-0.70
urgical
-0.70
QL
-0.68
POSITIVE LOGITS
attackers
0.96
attacker
0.91
assailants
0.80
attacked
0.78
suspects
0.76
beware
0.75
rapist
0.73
assailant
0.72
attack
0.70
intent
0.70
Activations Density 0.029%