INDEX
Explanations
mention of secrets that have been discovered or are hidden
concepts related to secrets and discoveries
New Auto-Interp
Negative Logits
udeb
-0.98
ĪĴ
-0.81
erate
-0.74
phasis
-0.73
reporting
-0.70
wk
-0.68
Agg
-0.67
Zup
-0.67
bernatorial
-0.67
istrates
-0.67
POSITIVE LOGITS
secrets
1.91
mysteries
1.77
hidden
1.51
secret
1.47
treasures
1.45
treasure
1.33
mysterious
1.33
mystery
1.32
truths
1.30
unexpl
1.27
Activations Density 0.551%