INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     buttonText
    -0.07
     зуст
    -0.07
     electoral
    -0.07
    elop
    -0.07
    tributes
    -0.06
    _segment
    -0.06
    ●●●●●●●●
    -0.06
    _truth
    -0.06
     thăm
    -0.06
    ısından
    -0.06
    POSITIVE LOGITS
     specialized
    0.07
     differentiate
    0.07
     Byrne
    0.07
     chefs
    0.07
    CLE
    0.07
    judul
    0.07
    ])↵↵↵
    0.06
    #$
    0.06
    %-
    0.06
     siblings
    0.06
    Act Density 0.001%

    No Known Activations