INDEX
Explanations
phrases related to criminal activities
references to legal or criminal situations
New Auto-Interp
Negative Logits
Supported
-0.68
oday
-0.65
SPONSORED
-0.64
MpServer
-0.62
iscons
-0.62
WAS
-0.62
BT
-0.62
arij
-0.61
HUD
-0.61
Access
-0.60
POSITIVE LOGITS
nowadays
1.00
dictators
0.89
presidents
0.86
rarely
0.85
usually
0.81
sometimes
0.79
tends
0.79
seldom
0.79
endings
0.77
Usually
0.76
Activations Density 0.894%