INDEX
    Explanations

    references to Facebook and its related features, functions, or pages

    New Auto-Interp
    Negative Logits
    weise
    -0.15
    ãĥ©ãĥĥãĤ¯
    -0.15
     weblog
    -0.15
    ONO
    -0.15
    одав
    -0.14
    äd
    -0.14
    Ñħи
    -0.14
    ephir
    -0.14
    venes
    -0.14
     Associ
    -0.14
    POSITIVE LOGITS
     Messenger
    0.30
    s
    0.28
     messenger
    0.24
    /twitter
    0.23
    /T
    0.22
     groups
    0.21
    istan
    0.20
    Messenger
    0.20
     Groups
    0.20
     page
    0.20
    Act Density 0.016%

    No Known Activations