INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     güven
    -0.06
    chemist
    -0.06
     diseases
    -0.06
    -0.06
     kır
    -0.06
    .ar
    -0.06
    chema
    -0.06
    rupt
    -0.06
    AME
    -0.06
    UFFIX
    -0.06
    POSITIVE LOGITS
     extents
    0.06
    umat
    0.06
    998
    0.06
     prix
    0.06
    0.06
     çoğu
    0.06
     М
    0.06
    997
    0.06
     integrate
    0.06
     пром
    0.06
    Act Density 0.007%

    No Known Activations