INDEX
Explanations
comparisons between different things or situations
comparative phrases indicating preference or alternatives
New Auto-Interp
Negative Logits
oret
-0.61
understatement
-0.58
itudinal
-0.57
hack
-0.57
Rah
-0.57
onom
-0.57
ãĤ®
-0.56
actic
-0.56
Pet
-0.54
mosp
-0.54
POSITIVE LOGITS
thereby
0.72
respectively
0.68
hammad
0.66
disadvant
0.64
soever
0.59
favour
0.58
Thousand
0.57
undet
0.57
anywhere
0.57
İĭ
0.57
Activations Density 0.910%