INDEX
    Explanations

    mentions of prestigious awards, accolades, or institutions

    instances of the word "prestigious."

    New Auto-Interp
    Negative Logits
    ghan
    -0.68
    sil
    -0.66
    irez
    -0.65
    Twe
    -0.65
    creator
    -0.65
    plant
    -0.64
    activated
    -0.64
    uber
    -0.64
    harm
    -0.64
    Radio
    -0.63
    POSITIVE LOGITS
     accol
    0.94
     awards
    0.91
    cffff
    0.88
     prestigious
    0.85
     prizes
    0.85
     honors
    0.83
     coveted
    0.78
     award
    0.78
     prest
    0.77
     endorsements
    0.74
    Act Density 0.034%

    No Known Activations