INDEX
Explanations
mentions of people involved in negative or criminal activities
references to individuals involved in crime-related incidents
New Auto-Interp
Negative Logits
ecause
-0.75
raltar
-0.72
cale
-0.69
aeda
-0.67
olesterol
-0.62
ilaterally
-0.61
OME
-0.60
Est
-0.60
agascar
-0.59
forestation
-0.59
POSITIVE LOGITS
's
0.83
testified
0.81
thanked
0.77
pleaded
0.75
nels
0.75
surn
0.74
wore
0.74
withdrew
0.74
went
0.73
lied
0.73
Activations Density 0.261%