INDEX
    Explanations

    proper nouns such as names of locations and organizations

    periods and punctuation marks

    New Auto-Interp
    Negative Logits
    hiba
    -0.78
    eatures
    -0.72
     eleph
    -0.68
    cius
    -0.67
     advis
    -0.66
     fulfillment
    -0.66
     appropriate
    -0.66
     refres
    -0.65
    idious
    -0.63
    rimp
    -0.62
    POSITIVE LOGITS
     Pool
    0.78
    McC
    0.77
    O
    0.76
     PARK
    0.73
     Pryor
    0.71
    MX
    0.71
     Mellon
    0.70
    J
    0.70
     Tribe
    0.68
    orks
    0.68
    Act Density 0.027%

    No Known Activations