INDEX
    Explanations

    punctuation marks, specifically colons

    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.06
    2:0.10
    3:0.08
    4:0.09
    5:0.06
    6:0.08
    7:0.09
    8:0.08
    9:0.06
    10:0.09
    11:0.08
    Negative Logits
    seekers
    -1.90
    thumbnails
    -1.70
    ENSE
    -1.68
    "]=>
    -1.64
    tarians
    -1.64
    eers
    -1.60
    dream
    -1.58
    verse
    -1.58
    chio
    -1.53
     giveaways
    -1.53
    POSITIVE LOGITS
     mart
    1.74
    abase
    1.72
     tops
    1.65
     Ples
    1.64
     downed
    1.64
    leck
    1.61
    hower
    1.61
     Brist
    1.59
    untled
    1.59
     pear
    1.56
    Act Density 0.000%

    No Known Activations