INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     abuses
    -0.12
     Parcel
    -0.10
    inka
    -0.10
     wrongdoing
    -0.10
     subpoena
    -0.10
    asaki
    -0.10
     taxing
    -0.10
    avers
    -0.09
    é¸
    -0.09
    Prostit
    -0.09
    POSITIVE LOGITS
     charge
    0.18
     charges
    0.16
     penalties
    0.16
     stiff
    0.15
     colony
    0.15
     à¤¸à¤ľ
    0.14
     conviction
    0.13
    charge
    0.13
    罪
    0.13
     punished
    0.13
    Act Density 0.077%

    No Known Activations