INDEX
Explanations
expressions of uncertainty or subjectivity about situations
New Auto-Interp
Negative Logits
ests
-0.82
rouse
-0.78
orem
-0.75
otos
-0.75
andise
-0.74
ilts
-0.71
pez
-0.69
izons
-0.68
perature
-0.68
venge
-0.68
POSITIVE LOGITS
unlikely
0.93
doubtful
0.91
probable
0.78
unclear
0.75
prudent
0.72
plausible
0.71
advisable
0.70
folly
0.70
feasible
0.70
imperative
0.69
Activations Density 0.035%