INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     스포츠
    -0.07
     دقیقه
    -0.06
    ptal
    -0.06
     شیمی
    -0.06
     구성
    -0.06
     místa
    -0.06
     luxe
    -0.06
    เย
    -0.06
     Manga
    -0.06
     Gos
    -0.06
    POSITIVE LOGITS
    _DOWN
    0.07
    ,《
    0.07
    0.07
    ackbar
    0.07
    -called
    0.07
     called
    0.07
     termed
    0.06
    andbox
    0.06
     DOWN
    0.06
     Barrett
    0.06
    Act Density 0.056%

    No Known Activations