INDEX
Explanations
quantitative expressions indicating a portion or fraction, such as "half of"
the phrase "more than half" or variations of it
New Auto-Interp
Negative Logits
ocr
-0.65
ffen
-0.62
berus
-0.55
igslist
-0.55
andr
-0.54
arrang
-0.52
spir
-0.52
orge
-0.51
hran
-0.51
kson
-0.51
POSITIVE LOGITS
azo
0.65
of
0.64
century
0.64
dozen
0.64
wheel
0.62
rene
0.61
century
0.61
terness
0.61
atos
0.60
million
0.59
Activations Density 0.052%