INDEX
Explanations
descriptions of criminal activities or events
references to criminal activities and incidents involving victims
New Auto-Interp
Negative Logits
Grade
-0.71
®
-0.68
ivably
-0.68
!]
-0.67
erning
-0.67
2020
-0.67
folio
-0.66
ularity
-0.64
!)
-0.64
Prediction
-0.64
POSITIVE LOGITS
orally
0.99
smelled
0.91
mol
0.89
Cosby
0.86
"'
0.84
intoxicated
0.83
verbally
0.82
panicked
0.82
abused
0.82
raped
0.81
Activations Density 0.572%