INDEX
Negative Logits
abuses
-0.12
Parcel
-0.10
inka
-0.10
wrongdoing
-0.10
subpoena
-0.10
asaki
-0.10
taxing
-0.10
avers
-0.09
é¸
-0.09
Prostit
-0.09
POSITIVE LOGITS
charge
0.18
charges
0.16
penalties
0.16
stiff
0.15
colony
0.15
à¤¸à¤ľ
0.14
conviction
0.13
charge
0.13
罪
0.13
punished
0.13
Activations Density 0.077%