INDEX
Explanations
discussions about honesty and truthfulness in various contexts
honesty and truth
stating the truth
New Auto-Interp
Negative Logits
tagHelper
-0.80
Paglinawan
-0.71
expandindo
-0.71
#+#
-0.70
Kaynakça
-0.68
Италијани
-0.68
complexContent
-0.66
adaptiveStyles
-0.62
:+:
-0.62
vscode
-0.62
POSITIVE LOGITS
truth
2.07
honesty
1.95
honest
1.89
truth
1.79
truthful
1.71
Truth
1.71
Truth
1.69
TRUTH
1.67
honest
1.60
truths
1.54
Activations Density 0.388%