INDEX
Explanations
quantifiers or comparative phrases indicating an increase or emphasis on quantity or degree
New Auto-Interp
Negative Logits
enta
-0.18
ewise
-0.15
.TableName
-0.15
ilis
-0.14
_nat
-0.14
worse
-0.14
опаÑģ
-0.14
anim
-0.14
enet
-0.14
uled
-0.13
POSITIVE LOGITS
recently
0.17
overt
0.16
recent
0.15
moderate
0.15
UNT
0.15
traditional
0.15
velle
0.15
usual
0.14
ey
0.14
è¡£
0.14
Activations Density 0.069%