INDEX
Explanations
words related to military operations and strategy
references to specific names, titles, or concepts associated with intelligence and secrecy
New Auto-Interp
Negative Logits
HM
-0.77
ower
-0.76
ulton
-0.75
stru
-0.74
user
-0.74
udeb
-0.71
alogue
-0.71
regn
-0.70
rylic
-0.69
USER
-0.69
POSITIVE LOGITS
atility
0.85
Gork
0.81
Emanuel
0.72
Emin
0.70
icated
0.68
giving
0.67
mole
0.67
reon
0.66
llah
0.65
andr
0.64
Activations Density 0.020%