INDEX
    Explanations

    country names, particularly the United States and United Kingdom

    names of countries and cities, particularly emphasizing the term "United."

    New Auto-Interp
    Negative Logits
    ————
    -0.61
     âĻ
    -0.60
     notation
    -0.60
    âĶĢâĶĢâĶĢâĶĢ
    -0.56
     these
    -0.55
     "<
    -0.55
     overt
    -0.54
     âĸº
    -0.54
     unde
    -0.54
    ****************
    -0.54
    POSITIVE LOGITS
    foundland
    0.76
    etheless
    0.70
    luaj
    0.62
     Hudson
    0.60
    é¾įåĸļ士
    0.58
    esses
    0.58
    amins
    0.57
     Strait
    0.56
    taboola
    0.55
    odium
    0.55
    Act Density 0.415%

    No Known Activations