INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    нож
    -0.06
     dynasty
    -0.06
    utra
    -0.06
    공지
    -0.06
     якості
    -0.06
     south
    -0.06
    fullscreen
    -0.06
    laughter
    -0.06
     Confeder
    -0.06
     initiated
    -0.06
    POSITIVE LOGITS
    keletal
    0.07
     джер
    0.06
    .clientX
    0.06
     risk
    0.06
    .tem
    0.06
    offs
    0.06
     worldly
    0.06
     Palette
    0.06
     tic
    0.06
     Marathon
    0.06
    Act Density 0.000%

    No Known Activations