INDEX
Explanations
references to espionage or surveillance activities
references to espionage and spying activities
New Auto-Interp
Negative Logits
xual
-0.77
ŃĶ
-0.73
opathic
-0.70
Explain
-0.68
esville
-0.65
usable
-0.65
bal
-0.64
Course
-0.63
oses
-0.63
nick
-0.63
POSITIVE LOGITS
ionage
0.93
sonian
0.88
spying
0.86
glass
0.81
dropping
0.80
doms
0.79
abroad
0.78
spoof
0.71
OSH
0.71
afia
0.71
Activations Density 0.021%