INDEX
Explanations
mentions of criminal incidents or accidents
references to criminal incidents and violence
New Auto-Interp
Negative Logits
izons
-0.80
bundles
-0.75
portfolios
-0.75
ocobo
-0.74
docs
-0.74
vana
-0.73
reserves
-0.72
ngth
-0.71
Tokens
-0.70
ophy
-0.69
POSITIVE LOGITS
perpetrated
1.03
occurred
1.00
escalate
0.99
caused
0.94
occurring
0.93
Scene
0.91
tragic
0.91
involving
0.89
unsolved
0.89
occur
0.88
Activations Density 0.504%