INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     User
    -0.09
     Normally
    -0.08
     Tabelle
    -0.08
     Versorgung
    -0.08
     Originally
    -0.07
     chemically
    -0.07
     характ
    -0.07
    所谓
    -0.07
     Déf
    -0.07
     Telephone
    -0.07
    POSITIVE LOGITS
    াশ
    0.08
     clim
    0.08
     cringe
    0.08
     diligence
    0.07
    0.07
     peregr
    0.07
    /year
    0.07
    하거나
    0.07
    0.07
     cog
    0.07
    Act Density 0.003%

    No Known Activations