INDEX
    Explanations

    evidence and data related to samples and their characteristics

    New Auto-Interp
    Negative Logits
     فريبيس
    -0.74
     مشين
    -0.73
     ffilmiau
    -0.72
     لينك
    -0.69
     ویکی‌پدیا
    -0.69
    ล้ว
    -0.66
    Билгалдахарш
    -0.65
    ResumeLayout
    -0.64
    afone
    -0.60
    SpringRunner
    -0.60
    POSITIVE LOGITS
    uitable
    0.54
    chner
    0.44
    arto
    0.44
    bij
    0.43
    uto
    0.43
    atsen
    0.43
     encara
    0.40
    enio
    0.39
    ciuto
    0.39
     эк
    0.39
    Act Density 0.326%

    No Known Activations