INDEX
Explanations
references to surveillance technology and facial recognition systems
New Auto-Interp
Negative Logits
gnore
-0.15
vandal
-0.14
_SUPPORTED
-0.14
ocide
-0.14
Wheels
-0.13
loff
-0.13
zar
-0.13
_HT
-0.13
/layouts
-0.13
sein
-0.13
POSITIVE LOGITS
surveillance
0.39
privacy
0.36
Surveillance
0.32
Privacy
0.31
surve
0.29
privacy
0.28
Privacy
0.28
sno
0.27
tracking
0.27
spy
0.26
Activations Density 0.204%