INDEX
Explanations
quantifiers and expressions of degree to indicate intensity or extent
New Auto-Interp
Negative Logits
467
-0.17
ator
-0.16
cko
-0.15
teri
-0.15
kir
-0.14
deny
-0.14
icit
-0.14
ارÙĬØ©
-0.13
baugh
-0.13
velte
-0.13
POSITIVE LOGITS
olt
0.17
fold
0.17
ingly
0.17
extent
0.16
awks
0.16
تز
0.16
Least
0.15
rost
0.15
.gs
0.14
during
0.14
Activations Density 0.102%