INDEX
    Explanations

    mentions of the United States

    occurrences of the abbreviation "U.S." or references to the United States

    New Auto-Interp
    Negative Logits
    theless
    -0.87
     STATS
    -0.71
     simmer
    -0.61
     unpre
    -0.60
     caution
    -0.59
    proof
    -0.56
     Cancel
    -0.55
     KP
    -0.55
     proportions
    -0.55
     organising
    -0.54
    POSITIVE LOGITS
    .,
    1.63
    .?
    1.36
    .;
    1.27
    .,"
    1.25
    .:
    1.24
    .-
    1.21
    ./
    1.19
    .—
    1.14
    .$
    1.05
    .),
    1.01
    Act Density 0.053%

    No Known Activations