INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    zés
    -0.07
     differing
    -0.07
    minute
    -0.07
    avar
    -0.07
    -0.07
     കോ
    -0.07
    Op
    -0.07
    -0.07
     modulus
    -0.07
    -0.07
    POSITIVE LOGITS
     bırak
    0.09
     tare
    0.08
     Dana
    0.08
     sauté
    0.08
     Married
    0.08
     alang
    0.08
    ,全
    0.07
    adax
    0.07
     gaf
    0.07
     רבה
    0.07
    Act Density 0.003%

    No Known Activations