INDEX
Explanations
language related to privacy and data protection
terms related to privacy and data protection
New Auto-Interp
Negative Logits
xual
-0.80
annis
-0.71
cki
-0.67
kell
-0.67
Production
-0.66
hani
-0.64
iatus
-0.63
mount
-0.63
eryl
-0.62
ensen
-0.61
POSITIVE LOGITS
rights
0.96
protections
0.93
Rights
0.91
privacy
0.90
liberties
0.81
safeguards
0.80
Liberties
0.77
freedoms
0.77
rights
0.75
Privacy
0.75
Activations Density 0.025%