INDEX
    Explanations

    expressions of humor or light-heartedness

    emoticons for happiness or playfulness

    New Auto-Interp
    Negative Logits
    دانشنامهٔ
    -0.83
    ArgsConstructor
    -0.82
    tagHelperRunner
    -0.76
    Ārējās
    -0.76
    "])
    
    -0.76
    ]")]
    -0.75
     يتيمه
    -0.74
    __':
    
    -0.73
    __":
    
    -0.73
     autorytatywna
    -0.71
    POSITIVE LOGITS
     :)
    0.56
    !
    0.49
     :-)
    0.41
    .
    0.39
     ;)
    0.39
    !!!
    0.38
     lol
    0.38
    :)
    0.37
     !
    0.36
     :))
    0.36
    Act Density 0.149%

    No Known Activations