INDEX
    Explanations

    mentions of the term "Anonymous" or username-related content

    instances of the word "Anonymous" and related terms

    New Auto-Interp
    Negative Logits
    eele
    -0.81
    =-=-=-=-=-=-=-=-
    -0.75
     Gork
    -0.74
    asters
    -0.70
    rest
    -0.70
    ++++++++++++++++
    -0.69
    efully
    -0.66
    tsky
    -0.65
    orses
    -0.64
    enegger
    -0.64
    POSITIVE LOGITS
     Anonymous
    0.81
    ica
    0.72
    Anonymous
    0.71
    onymous
    0.69
     obliged
    0.67
     hacker
    0.67
     Warfare
    0.66
    ãĥĩ
    0.65
    cott
    0.64
    uthor
    0.63
    Act Density 0.012%

    No Known Activations