INDEX
    Explanations

    proper nouns and specific names

    New Auto-Interp
    Negative Logits
    elist
    -0.49
    opol
    -0.47
    arb
    -0.46
    worldly
    -0.46
    chron
    -0.46
     Phi
    -0.44
     unpre
    -0.42
    quel
    -0.41
    mint
    -0.41
     Unt
    -0.40
    POSITIVE LOGITS
    iard
    0.66
    sburgh
    0.61
    sburg
    0.59
    espie
    0.53
    enium
    0.51
    ingham
    0.51
    bury
    0.49
    IONS
    0.48
    uminati
    0.48
    iflower
    0.47
    Act Density 7.986%

    No Known Activations