INDEX
Explanations
references to the concept of truth and its implications in various contexts
New Auto-Interp
Negative Logits
nonatomic
-0.97
Berikut
-0.86
AppMethodBeat
-0.84
gero
-0.83
cory
-0.81
Aras
-0.80
Garvey
-0.79
nahilalakip
-0.79
Gentry
-0.76
cenary
-0.75
POSITIVE LOGITS
truth
0.82
lessly
0.77
esserung
0.75
lie
0.74
Truths
0.74
Truth
0.74
chalk
0.73
truths
0.72
lies
0.72
truth
0.71
Activations Density 0.102%