INDEX
Explanations
references to criminal activities and legal proceedings surrounding a crime
New Auto-Interp
Negative Logits
leaf
-0.68
ooks
-0.64
ove
-0.64
princip
-0.64
authoritarian
-0.63
efficiency
-0.62
raltar
-0.60
taxing
-0.60
binding
-0.59
andel
-0.59
POSITIVE LOGITS
istics
0.88
izes
0.88
izer
0.85
izers
0.82
vict
0.81
ization
0.78
Victim
0.77
inflicted
0.77
victim
0.77
hood
0.76
Activations Density 0.030%