INDEX
Explanations
conditional phrases expressing desires or recommendations
New Auto-Interp
Negative Logits
Ennis
-0.89
Adair
-0.80
swans
-0.77
Lipa
-0.74
Freitas
-0.73
Schwan
-0.71
noun
-0.71
cephala
-0.70
Reina
-0.70
Ciro
-0.70
POSITIVE LOGITS
would
1.96
Would
1.82
WOULD
1.78
Would
1.76
would
1.70
ULD
1.17
could
1.12
ould
1.10
wouldn
1.04
würde
1.04
Activations Density 0.237%