INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     진행
    -0.06
    :?
    -0.06
     casts
    -0.06
     variety
    -0.06
     coursework
    -0.06
    Ci
    -0.06
    Constants
    -0.06
    -0.06
     ceux
    -0.06
     
    -0.06
    POSITIVE LOGITS
     indemn
    0.18
     blame
    0.09
    900
    0.07
     Lennon
    0.06
    edin
    0.06
    งน
    0.06
     Haven
    0.06
     نگاه
    0.06
     Revolutionary
    0.06
    income
    0.06
    Act Density 0.003%

    No Known Activations