INDEX
    Explanations

    references to surveillance technology and privacy concerns

    New Auto-Interp
    Negative Logits
    __':
    
    -0.67
    __":
    
    -0.64
    AndEndTag
    -0.63
    __':
    -0.61
    SequentialGroup
    -0.59
    __":
    -0.59
    UserScript
    -0.58
    httphttps
    -0.56
    kháu
    -0.55
     autorytatywna
    -0.53
    POSITIVE LOGITS
    ↵↵
    0.49
    LikeLiked
    0.43
     şeklinde
    0.38
    skechers
    0.38
    oneofs
    0.37
    Abitanti
    0.36
     debout
    0.34
    windowFixed
    0.34
     Lune
    0.33
    category
    0.33
    Act Density 0.004%

    No Known Activations