INDEX
Explanations
quantitative comparisons using the word "times"
phrases indicating multiplicative comparisons or ratios
New Auto-Interp
Negative Logits
cember
-0.78
services
-0.69
################################
-0.64
rals
-0.64
iku
-0.62
apolis
-0.61
LCS
-0.60
uctions
-0.60
liction
-0.59
bies
-0.59
POSITIVE LOGITS
slower
0.89
stronger
0.88
greater
0.88
faster
0.87
louder
0.84
avier
0.84
cheaper
0.83
hotter
0.83
worse
0.81
heavier
0.80
Activations Density 0.041%