INDEX
    Explanations

    geographical locations and states

    New Auto-Interp
    Negative Logits
    alis
    -0.17
    573
    -0.16
    quoi
    -0.15
    ysz
    -0.15
    atak
    -0.15
    ittel
    -0.14
     Woodward
    -0.14
     pros
    -0.14
    oes
    -0.14
    rog
    -0.14
    POSITIVE LOGITS
     USA
    0.29
    USA
    0.25
     usa
    0.21
     Usa
    0.18
    achuset
    0.18
    orida
    0.18
     СШÐIJ
    0.16
    _US
    0.15
    /world
    0.14
    serter
    0.14
    Act Density 0.110%

    No Known Activations