INDEX
    Explanations

    words related to privacy or data protection

    New Auto-Interp
    Negative Logits
    xual
    -0.82
    kell
    -0.72
    cki
    -0.70
    annis
    -0.67
    mount
    -0.63
    ithing
    -0.63
    ensen
    -0.63
    ×Ļ×
    -0.63
    iatus
    -0.61
    flat
    -0.61
    POSITIVE LOGITS
     privacy
    0.89
     Rights
    0.86
     rights
    0.86
     protections
    0.84
     Liberties
    0.78
     safeguards
    0.75
     liberties
    0.75
     Preferences
    0.74
    rights
    0.74
    policy
    0.73
    Act Density 0.021%

    No Known Activations