INDEX
    Explanations

    references to messages

    references to messages or communications

    New Auto-Interp
    Negative Logits
    engeance
    -0.80
    aughs
    -0.74
    itals
    -0.73
    arte
    -0.69
    erenn
    -0.68
    emale
    -0.67
    ONSORED
    -0.66
    urses
    -0.66
    pmwiki
    -0.66
    rowd
    -0.66
    POSITIVE LOGITS
     messages
    0.98
     goodbye
    0.91
    board
    0.91
    box
    0.89
    message
    0.88
     message
    0.87
    boxes
    0.86
     Messages
    0.86
    FontSize
    0.84
     sent
    0.83
    Act Density 0.032%

    No Known Activations