INDEX
Explanations
words related to hidden information or confidential matters
phrases and words related to secrets
New Auto-Interp
Negative Logits
orer
-0.67
MLA
-0.64
urch
-0.64
ariat
-0.63
phe
-0.62
Huss
-0.62
CBO
-0.62
unicip
-0.61
Paw
-0.61
Nationwide
-0.60
POSITIVE LOGITS
secrets
4.11
Secrets
2.57
secret
2.06
mysteries
1.96
secrecy
1.65
secret
1.63
Secret
1.55
treasures
1.52
truths
1.50
surprises
1.40
Activations Density 0.017%