INDEX
Explanations
references to confidential information or secrets
the word "Secret" in various contexts related to secrecy or confidential matters
New Auto-Interp
Negative Logits
grap
-0.76
Ö
-0.72
fits
-0.71
displacement
-0.69
BCE
-0.69
displ
-0.69
ettle
-0.67
bom
-0.66
orth
-0.66
tackle
-0.63
POSITIVE LOGITS
Secret
4.00
Secret
3.11
secret
2.27
secret
2.09
Secrets
2.00
secrets
1.47
Mysterious
1.32
Hidden
1.23
Postal
1.20
KGB
1.19
Activations Density 0.015%