INDEX
Explanations
phrases and sentences containing the word "Truth"
New Auto-Interp
Negative Logits
igan
-0.72
sprint
-0.67
minor
-0.67
pockets
-0.65
ipop
-0.65
slot
-0.64
inarily
-0.63
*/
-0.62
registrations
-0.62
interval
-0.61
POSITIVE LOGITS
Truth
3.88
Truth
2.91
truth
2.47
truth
1.87
truths
1.73
Reality
1.49
Honest
1.28
Honest
1.25
Facts
1.25
falsehood
1.17
Activations Density 0.012%