INDEX
    Explanations

    calls to action for sharing content with others

    phrases advocating for sharing with others

    New Auto-Interp
    Negative Logits
    ischer
    -0.81
    ©¶æ¥µ
    -0.77
    itute
    -0.66
    ouf
    -0.65
     reviewed
    -0.65
    ovich
    -0.64
    hemat
    -0.64
    orah
    -0.63
    chin
    -0.62
    sylvania
    -0.61
    POSITIVE LOGITS
     regards
    1.17
     regard
    1.09
    stood
    1.00
    draw
    0.93
     impunity
    0.92
     respect
    0.88
     dignity
    0.81
     Pastebin
    0.79
     srfAttach
    0.77
     us
    0.75
    Act Density 0.130%

    No Known Activations