INDEX
Explanations
terms related to espionage and surveillance
New Auto-Interp
Negative Logits
stk
-0.18
izer
-0.16
imers
-0.16
stime
-0.15
ioned
-0.15
bred
-0.15
иÑĤÑĥ
-0.14
quan
-0.14
ucha
-0.14
edException
-0.14
POSITIVE LOGITS
ware
0.23
glass
0.22
der
0.21
satellites
0.21
bubble
0.19
ros
0.18
-fi
0.18
ionage
0.18
.spy
0.18
satellite
0.17
Activations Density 0.010%