INDEX
    Explanations

    but/however

    New Auto-Interp
    Negative Logits
     wpis
    -0.08
     социал
    -0.07
     een
    -0.07
    -0.07
     SZ
    -0.07
    ្គ
    -0.07
    со
    -0.07
    -0.07
    _GAME
    -0.07
     social
    -0.07
    POSITIVE LOGITS
     beachten
    0.08
     이렇게
    0.08
    itchie
    0.08
    0.08
     cave
    0.08
     Cave
    0.08
    ------------------------------------------------------------------------------------------------
    0.08
    preg
    0.08
    ibbean
    0.08
    0.08
    Act Density 0.066%

    No Known Activations