INDEX
    Explanations

    mentions of futuristic concepts or technologies

    punctuation that signifies the end of sentences

    New Auto-Interp
    Negative Logits
     inclusion
    -0.77
    ominated
    -0.73
     interchange
    -0.72
     imperson
    -0.72
     reb
    -0.71
    burgh
    -0.71
     involuntary
    -0.68
     systematically
    -0.68
     nonexistent
    -0.68
     representation
    -0.68
    POSITIVE LOGITS
    <|endoftext|>
    1.69
     ®
    1.43
     Hopefully
    1.40
     Until
    1.22
     Regardless
    1.16
     Stay
    1.15
     Otherwise
    1.14
     UPDATE
    1.12
     ;)
    1.09
     Unless
    1.08
    Act Density 0.477%

    No Known Activations