INDEX
Explanations
references to confidential information
terms and phrases related to confidentiality and sensitive information
New Auto-Interp
Negative Logits
̶
-0.90
alis
-0.88
annis
-0.87
plex
-0.77
owitz
-0.75
Mania
-0.73
imus
-0.73
ixels
-0.71
okemon
-0.70
akeru
-0.70
POSITIVE LOGITS
confidential
1.17
informants
1.03
informant
1.03
idential
0.97
correspondence
0.84
documents
0.82
confidentiality
0.82
information
0.80
arial
0.80
arrangements
0.79
Activations Density 0.017%