INDEX
Explanations
details and uncovering activities related to investigations, uncovering hidden information, and revealing secrets
phrases related to uncovering secrets and investigative journalism
New Auto-Interp
Negative Logits
gain
-0.71
amph
-0.70
Reduce
-0.69
endment
-0.68
equal
-0.67
neutral
-0.67
accompanied
-0.66
meal
-0.65
weight
-0.64
ĪĴ
-0.63
POSITIVE LOGITS
secrets
1.79
workings
1.44
secret
1.37
mysteries
1.34
hidden
1.32
truth
1.26
truths
1.25
depths
1.18
backstory
1.16
untold
1.14
Activations Density 0.433%