INDEX
    Explanations

    words related to proper nouns or specific names

    capital letters, specifically initials or names likely associated with titles or important terms

    New Auto-Interp
    Negative Logits
    è£ħ
    -0.62
    REDACTED
    -0.60
     DOE
    -0.59
    stellar
    -0.58
    DragonMagazine
    -0.58
    hua
    -0.57
     Bayer
    -0.57
    CBC
    -0.56
     ESV
    -0.55
    éĹĺ
    -0.54
    POSITIVE LOGITS
    ombat
    0.93
    asses
    0.88
    umps
    0.85
    ifter
    0.84
    oots
    0.83
    elt
    0.78
    igg
    0.78
    ormon
    0.78
    oot
    0.78
    eret
    0.77
    Act Density 0.187%

    No Known Activations