INDEX
    Explanations

    words relating to the United States or U.S. interactions with other countries

    references to the United States

    New Auto-Interp
    Negative Logits
     STATS
    -0.69
    */(
    -0.68
    theless
    -0.65
    regor
    -0.64
    razil
    -0.64
     Proto
    -0.63
     shave
    -0.62
    fields
    -0.62
    bler
    -0.62
    adobe
    -0.61
    POSITIVE LOGITS
    ADA
    0.92
    GI
    0.87
    ESSION
    0.86
    ierra
    0.84
     Embassy
    0.83
    eal
    0.82
    .$
    0.81
    .,
    0.80
    IDA
    0.80
    AAF
    0.79
    Act Density 0.045%

    No Known Activations