INDEX
    Explanations

    expressions related to states of being and living situations

    New Auto-Interp
    Negative Logits
    antu
    -0.17
    ghi
    -0.17
     elsewhere
    -0.15
    åºŃ
    -0.15
    inline
    -0.14
    inds
    -0.14
    acha
    -0.14
    ane
    -0.14
    akit
    -0.14
    OLT
    -0.14
    POSITIVE LOGITS
     ÙħÙĨÙĩ
    0.20
     ÙģÙĬÙĩ
    0.16
    å±ŀ
    0.16
    upon
    0.16
     عÙĦÙĬÙĩا
    0.16
    å¤Ħ
    0.15
    è¦
    0.15
    udden
    0.15
     upon
    0.15
    icle
    0.15
    Act Density 0.143%

    No Known Activations