INDEX
Explanations
phrases related to comparison and evaluation
phrases emphasizing a quantity or amount
New Auto-Interp
Negative Logits
months
-0.62
livest
-0.61
TOP
-0.61
ynes
-0.60
cleaners
-0.60
overe
-0.59
alez
-0.58
withd
-0.58
etts
-0.56
fters
-0.56
POSITIVE LOGITS
course
0.82
erous
0.80
them
0.75
course
0.74
yours
0.69
ours
0.69
theirs
0.66
Sense
0.66
us
0.65
those
0.64
Activations Density 0.061%