INDEX
Explanations
words indicating doubt or uncertainty
words that indicate uncertainty or incompleteness
New Auto-Interp
Negative Logits
Emin
-0.83
Cosponsors
-0.82
çļ
-0.71
agents
-0.71
rather
-0.68
Might
-0.68
imoto
-0.64
UGH
-0.64
ILY
-0.63
ð
-0.62
POSITIVE LOGITS
anymore
0.80
satisfactory
0.80
nor
0.79
accurate
0.74
sure
0.73
satisfied
0.71
convincing
0.69
eworthy
0.69
icable
0.68
flawless
0.67
Activations Density 0.051%