INDEX
Explanations
leaked or confidential information
New Auto-Interp
Negative Logits
tics
-0.98
apo
-0.94
eeks
-0.87
anship
-0.85
ertodd
-0.85
aido
-0.82
rats
-0.82
abiding
-0.81
addafi
-0.81
cellence
-0.80
POSITIVE LOGITS
version
1.11
correspondence
1.11
memorandum
1.10
documents
1.06
document
1.04
statement
1.04
memo
1.01
questionnaire
1.00
letter
0.99
transcript
0.97
Activations Density 0.137%