INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     unity
    -0.08
     independiente
    -0.08
     UNITY
    -0.08
     indip
    -0.08
    Unity
    -0.08
    <Unity
    -0.08
     profil
    -0.08
     Binance
    -0.08
    -0.07
     reconsider
    -0.07
    POSITIVE LOGITS
     intentionally
    0.16
     deliberately
    0.13
     purposely
    0.12
     intentional
    0.12
     volontaire
    0.11
     malformed
    0.11
     faulty
    0.10
     knowingly
    0.10
    entionally
    0.10
     imperfections
    0.10
    Act Density 0.015%

    No Known Activations