INDEX
Explanations
phrases related to truth-seeking and the act of revealing or exposing hidden information
New Auto-Interp
Negative Logits
èIJ¬
-0.16
pty
-0.15
UTF
-0.14
Toolkit
-0.13
ARAM
-0.13
.lp
-0.13
scp
-0.13
ipar
-0.12
eyer
-0.12
ordo
-0.12
POSITIVE LOGITS
truth
0.75
truth
0.63
Truth
0.62
Truth
0.57
truths
0.55
verdad
0.51
_truth
0.47
reality
0.44
.truth
0.42
TR
0.40
Activations Density 0.118%