INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     indisponible
    -0.55
    olak
    -0.53
    uttavia
    -0.53
    zeera
    -0.52
     Konkurrenz
    -0.52
     Bühne
    -0.51
    makeConstraints
    -0.51
     bezpośred
    -0.50
     presto
    -0.49
     Locator
    -0.49
    POSITIVE LOGITS
     sins
    0.62
     crimes
    0.59
     pecado
    0.56
     pecados
    0.56
     offenses
    0.54
     crime
    0.54
     sin
    0.53
     sinned
    0.52
     offences
    0.52
     sinful
    0.52
    Act Density 0.007%

    No Known Activations