INDEX
    Explanations

    social media platform names

    references to popular social media platforms

    New Auto-Interp
    Negative Logits
     Samar
    -0.74
    orem
    -0.69
    ersen
    -0.68
    ORGE
    -0.68
     guiName
    -0.67
    Bey
    -0.67
     Lauder
    -0.65
    annis
    -0.65
    erella
    -0.63
    izoph
    -0.62
    POSITIVE LOGITS
    facebook
    0.71
    Ĥİ
    0.68
     Sina
    0.64
     Hacker
    0.64
    Tumblr
    0.64
    grounds
    0.64
    ĨĴ
    0.63
    ®
    0.63
    Joined
    0.62
    IJ
    0.61
    Act Density 0.064%

    No Known Activations