INDEX
    Explanations

    instances of emoticons or symbols used in informal communication

    New Auto-Interp
    Negative Logits
    skirts
    -0.07
    isci
    -0.07
    sburgh
    -0.07
    ransition
    -0.07
    uite
    -0.07
     pornofilm
    -0.07
    sburg
    -0.07
    uslim
    -0.07
    erras
    -0.07
    uire
    -0.07
    POSITIVE LOGITS
    icz
    0.07
    zu
    0.07
    eger
    0.07
    ook
    0.06
    İ
    0.06
     dope
    0.06
    oki
    0.06
    Ħ
    0.06
    amas
    0.06
     Vincent
    0.06
    Act Density 0.001%

    No Known Activations