INDEX
Explanations
phrases indicating certainty or confidence
expressions of certainty or lack of doubt
New Auto-Interp
Negative Logits
eller
-0.85
ellery
-0.73
emetery
-0.72
ebra
-0.72
uterte
-0.71
cler
-0.70
ells
-0.69
holder
-0.69
oiler
-0.69
ocket
-0.68
POSITIVE LOGITS
whatsoever
0.89
coerced
0.80
tempted
0.76
rightly
0.73
spurred
0.71
provoked
0.71
prompted
0.70
exagger
0.69
underest
0.69
msec
0.68
Activations Density 0.014%