INDEX
Explanations
classified documents and sensitive information being shared or leaked
New Auto-Interp
Negative Logits
Interstitial
-0.94
culosis
-0.82
stad
-0.74
ntil
-0.69
iasis
-0.69
Õ
-0.68
cers
-0.64
axis
-0.63
icidal
-0.62
adium
-0.62
POSITIVE LOGITS
pertaining
0.92
heet
0.88
detailing
0.87
documents
0.85
relating
0.83
leaked
0.82
trove
0.81
declass
0.78
hig
0.78
dumps
0.77
Activations Density 11.394%