INDEX
    Explanations

    emotive expressions or symbols indicating laughter and emotions

    emoticons and special characters

    New Auto-Interp
    Negative Logits
     صوتيه
    -0.76
    twimg
    -0.71
    fjspx
    -0.70
    featureID
    -0.69
    kháu
    -0.69
    OGND
    -0.64
    ロウィン
    -0.63
    ſicht
    -0.62
    tagext
    -0.62
    niſſe
    -0.60
    POSITIVE LOGITS
     mesmos
    0.38
     conquista
    0.38
    RefNanny
    0.37
    自己
    0.36
     Svensson
    0.34
    Yet
    0.34
     sağ
    0.34
    EqualsAnd
    0.32
    0.32
    Why
    0.32
    Act Density 0.010%

    No Known Activations