INDEX
    Explanations

    references to low-income situations or contexts

    New Auto-Interp
    Negative Logits
    ookies
    -0.18
    uner
    -0.16
    ute
    -0.15
    ione
    -0.15
    pais
    -0.15
    ixels
    -0.15
    ockets
    -0.15
    ัà¸ģà¸Ķ
    -0.14
    igu
    -0.14
    αι
    -0.14
    POSITIVE LOGITS
    down
    0.27
    enstein
    0.27
     hanging
    0.26
    -key
    0.26
    -cost
    0.25
     Hanging
    0.25
    /no
    0.25
    ongan
    0.24
    liest
    0.23
    rance
    0.23
    Act Density 0.047%

    No Known Activations