INDEX
Explanations
phrases related to decision-making and evaluative judgments
New Auto-Interp
Negative Logits
hjäl
-0.61
Reſ
-0.61
ſeveral
-0.60
auroit
-0.57
igjen
-0.54
cammin
-0.53
Eſ
-0.52
avoient
-0.52
étoient
-0.52
Reverso
-0.52
POSITIVE LOGITS
}{*}{0.73
preferring
0.57
})`
0.54
говорю
0.53
prefer
0.52
recommend
0.51
consider
0.49
Архівовано
0.49
gerne
0.49
hesitate
0.46
Activations Density 0.468%