INDEX
    Explanations

    references to geographical locations and the concept of "American."

    New Auto-Interp
    Negative Logits
    uga
    -0.15
    CLUD
    -0.14
    ritz
    -0.14
    uju
    -0.14
    å²³
    -0.14
    ená
    -0.14
    .hwp
    -0.13
    Äįe
    -0.13
    anova
    -0.13
    å¤
    -0.13
    POSITIVE LOGITS
     U
    0.50
     United
    0.47
     Un
    0.41
     Unt
    0.39
    United
    0.38
     US
    0.36
    U
    0.34
    Un
    0.31
     united
    0.29
     UNITED
    0.28
    Act Density 0.129%

    No Known Activations