INDEX
Explanations
criminal activities or incidents related to violence and robbery
New Auto-Interp
Negative Logits
ire
-0.68
ocus
-0.64
Ratings
-0.64
roads
-0.61
rastructure
-0.60
Consortium
-0.59
chieve
-0.58
Month
-0.58
Investment
-0.58
ocl
-0.57
POSITIVE LOGITS
prompting
1.09
injuring
0.94
resulting
0.94
causing
0.91
intending
0.88
according
0.87
thereby
0.86
then
0.86
unaware
0.83
forcing
0.83
Activations Density 0.334%