INDEX
Explanations
words related to privacy and data protection
New Auto-Interp
Negative Logits
xual
-0.71
kell
-0.71
cki
-0.66
Production
-0.66
mount
-0.64
stad
-0.64
enson
-0.64
ithing
-0.64
nant
-0.64
lift
-0.63
POSITIVE LOGITS
protections
1.00
rights
0.99
Rights
0.93
safeguards
0.88
privacy
0.87
rights
0.85
liberties
0.83
policy
0.80
Liberties
0.80
freedoms
0.79
Activations Density 0.024%