INDEX
    Explanations

    conversational phrases indicating appreciation or acknowledgment

    New Auto-Interp
    Negative Logits
     in
    -0.71
    ,
    -0.64
    -0.62
     a
    -0.58
    a
    -0.57
    v
    -0.57
    b
    -0.55
    c
    -0.53
     most
    -0.53
     one
    -0.52
    POSITIVE LOGITS
    UnusedPrivate
    1.21
     Reſ
    1.20
     للاسماء
    1.19
    DeleteBehavior
    1.19
    bootstrapcdn
    1.15
     Efq
    1.15
    Personensuche
    1.11
     purpoſe
    1.09
    +#+#
    1.09
     وتسجيلات
    1.07
    Act Density 0.219%

    No Known Activations