INDEX
Explanations
comparative phrases indicating a multiple increase or decrease
quantitative comparisons involving the word "times."
New Auto-Interp
Negative Logits
Reviewer
-1.05
aceous
-0.84
ogy
-0.78
CHAT
-0.78
arty
-0.77
irt
-0.76
liction
-0.73
OST
-0.70
XT
-0.69
RAW
-0.69
POSITIVE LOGITS
cale
0.98
consecut
0.83
paces
0.75
gestation
0.75
pan
0.72
cens
0.69
bp
0.69
hops
0.67
poons
0.66
cus
0.64
Activations Density 0.023%