INDEX
Explanations
mentions of confidence and trust in various contexts
expressions of certainty or trust in various subjects
New Auto-Interp
Negative Logits
bard
-0.73
artist
-0.70
perse
-0.69
sites
-0.69
pmwiki
-0.66
cold
-0.66
Kin
-0.65
hid
-0.64
bye
-0.63
cise
-0.63
POSITIVE LOGITS
worthiness
1.19
intervals
1.08
ially
0.93
interval
0.80
assurance
0.79
lessly
0.77
confidence
0.77
worthy
0.76
knowing
0.73
fulness
0.73
Activations Density 0.055%