INDEX
Explanations
phrases indicating a comparison between two options
comparative phrases that emphasize distinctions or alternatives
New Auto-Interp
Negative Logits
awar
-0.82
ug
-0.77
itiz
-0.76
enium
-0.75
ns
-0.72
hr
-0.72
eg
-0.72
iatus
-0.72
wm
-0.68
tan
-0.68
POSITIVE LOGITS
preferably
0.82
assuming
0.70
apologies
0.70
alternatively
0.70
optionally
0.69
evidenced
0.68
Ͻ
0.66
perhaps
0.65
whichever
0.65
allowances
0.65
Activations Density 0.286%