INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ลา
    -0.07
     crimes
    -0.07
    -changing
    -0.06
     socially
    -0.06
    тон
    -0.06
    encoding
    -0.06
    -0.06
    _utilities
    -0.06
    ของค
    -0.06
     corrections
    -0.06
    POSITIVE LOGITS
     rins
    0.08
     무슨
    0.07
     univerz
    0.07
    =======↵
    0.06
    209
    0.06
    ávající
    0.06
     epub
    0.06
    <TSource
    0.06
     MyApp
    0.06
     kosher
    0.06
    Act Density 0.023%

    No Known Activations