INDEX
    Explanations

    expressions of gratitude and positive encouragement

    New Auto-Interp
    Negative Logits
    …).
    -0.75
    ?!"
    -0.73
    IUrlHelper
    -0.71
    ?!”
    -0.69
    ...".
    -0.68
    LookAnd
    -0.68
    ...).
    -0.67
    RenderAtEndOf
    -0.63
    findpost
    -0.62
    Worse
    -0.61
    POSITIVE LOGITS
     :)
    1.21
     <
    1.15
     (:
    1.14
    :)
    1.08
     =)
    1.04
     :))
    0.97
     xoxo
    0.95
     ☺️
    0.95
     ^_^
    0.94
     :)</
    0.93
    Act Density 0.253%

    No Known Activations