INDEX
    Explanations

    issues related to surveillance and privacy violations

    New Auto-Interp
    Negative Logits
    شهاد
    -0.69
     >=",
    -0.56
     recommandée
    -0.55
     ungrateful
    -0.54
    AddTagHelper
    -0.53
    ########.
    -0.52
    🎰
    -0.51
     disambiguazione
    -0.51
     Expédié
    -0.51
    Reentrant
    -0.51
    POSITIVE LOGITS
     privacy
    0.96
     Privacy
    0.83
    privacy
    0.82
    Privacy
    0.79
     Liberties
    0.78
     PRIVACY
    0.78
    PRIVACY
    0.75
     surveillance
    0.74
     Surveillance
    0.71
     dystop
    0.69
    Act Density 0.678%

    No Known Activations