INDEX
    Explanations

    references to the nation's well-being or concerns

    New Auto-Interp
    Negative Logits
    rud
    -0.17
     Bethlehem
    -0.16
    inkle
    -0.16
    ocha
    -0.15
    pais
    -0.14
    IGIN
    -0.14
    enty
    -0.14
    los
    -0.14
    peg
    -0.14
    iker
    -0.14
    POSITIVE LOGITS
    ATAB
    0.15
    aname
    0.14
    λεκ
    0.14
    isel
    0.14
    esson
    0.14
    azes
    0.14
    URA
    0.13
    оÑĢо
    0.13
    HttpRequest
    0.13
    setState
    0.13
    Act Density 0.039%

    No Known Activations