INDEX
Explanations
comparisons using the word "than."
comparative expressions emphasizing "more than" or "less than" comparisons
New Auto-Interp
Negative Logits
umbn
-0.83
estern
-0.78
illary
-0.76
onomy
-0.70
auri
-0.70
esm
-0.67
ero
-0.65
uto
-0.65
ango
-0.64
eni
-0.63
POSITIVE LOGITS
anything
1.24
any
1.05
anyone
0.94
anybody
0.93
necessarily
0.85
ever
0.81
usual
0.81
anywhere
0.80
anymore
0.69
vice
0.69
Activations Density 0.083%