INDEX
Explanations
phrases related to confidentiality and private information
mentions of confidentiality and related terms
New Auto-Interp
Negative Logits
annis
-0.95
ð
-0.79
̶
-0.78
Archdemon
-0.71
iph
-0.70
uph
-0.70
apple
-0.68
akings
-0.68
ixels
-0.68
udeb
-0.67
POSITIVE LOGITS
confidential
1.44
informant
1.20
informants
1.10
idential
1.08
confidentiality
1.03
correspondence
0.86
documents
0.84
briefings
0.83
transmissions
0.81
memos
0.81
Activations Density 0.005%