INDEX
Explanations
references to truth and its various interpretations
New Auto-Interp
Negative Logits
ing
-0.66
o
-0.65
addPreferredGap
-0.63
ené
-0.58
ená
-0.58
pdev
-0.57
anjo
-0.57
dyn
-0.56
appName
-0.56
Redmond
-0.55
POSITIVE LOGITS
Truth
1.37
TRUTH
1.25
Truth
1.23
truth
1.22
truth
1.16
truths
1.15
Truths
1.13
fulness
0.91
Wahrheit
0.90
Tahu
0.89
Activations Density 0.005%