INDEX
Explanations
phrases related to decision-making
statements about subjective opinions or evaluations
New Auto-Interp
Negative Logits
mares
-0.86
ciating
-0.74
76561
-0.73
ãĤ©
-0.72
details
-0.71
azon
-0.69
igs
-0.69
arthed
-0.68
igg
-0.65
ffe
-0.64
POSITIVE LOGITS
appropriate
1.49
prudent
1.44
advisable
1.42
worthwhile
1.39
necessary
1.34
preferable
1.31
desirable
1.28
appropriate
1.26
acceptable
1.25
advantageous
1.25
Activations Density 0.318%