INDEX
Explanations
quantifiers indicating frequency or quantity in discussions
New Auto-Interp
Negative Logits
ieres
-0.17
ones
-0.16
ocz
-0.14
ër
-0.14
occasion
-0.14
ibus
-0.14
chner
-0.14
agne
-0.14
UnderTest
-0.14
Ïģιν
-0.13
POSITIVE LOGITS
of
0.17
observers
0.16
erot
0.16
readers
0.15
have
0.15
/m
0.15
such
0.15
experts
0.14
are
0.14
existing
0.14
Activations Density 0.103%