INDEX
    Explanations

    privacy-related terms

    references to privacy and related policies

    New Auto-Interp
    Negative Logits
    xual
    -0.92
    kell
    -0.72
    cki
    -0.71
    annis
    -0.70
    Production
    -0.67
    ×Ļ×
    -0.66
    flat
    -0.66
    ings
    -0.65
    enegger
    -0.64
    urgy
    -0.62
    POSITIVE LOGITS
     protections
    0.88
     rights
    0.87
     privacy
    0.84
     Rights
    0.83
     safeguards
    0.77
    ileged
    0.73
     Liberties
    0.73
    rights
    0.73
     Preferences
    0.72
    eties
    0.71
    Act Density 0.021%

    No Known Activations