INDEX
Explanations
terms related to espionage and spying
terms related to espionage and surveillance activities
New Auto-Interp
Negative Logits
xual
-0.81
esville
-0.73
à¼
-0.71
ĸļ
-0.70
Kurd
-0.68
jiang
-0.67
Centauri
-0.66
Explain
-0.66
ŃĶ
-0.64
clus
-0.64
POSITIVE LOGITS
glass
0.90
oleon
0.90
sonian
0.88
OSH
0.84
spoof
0.77
plane
0.75
spying
0.75
spies
0.70
ionage
0.70
HQ
0.70
Activations Density 0.027%