INDEX
Explanations
phrases related to spy activities and spy software
descriptions of spy-related products and their features
New Auto-Interp
Negative Logits
Slate
-0.85
"—
-0.83
Ramsay
-0.81
Advertisement
-0.80
—"
-0.80
Blake
-0.79
Burke
-0.78
Scott
-0.78
Whedon
-0.78
Jackson
-0.77
POSITIVE LOGITS
resear
1.09
´
1.08
till
1.04
english
1.02
happ
0.97
analyse
0.97
physic
0.95
alot
0.92
learnt
0.89
!!
0.89
Activations Density 1.990%