INDEX
Explanations
definite articles and their occurrences in context
New Auto-Interp
Negative Logits
opportunity
-0.18
likes
-0.17
entire
-0.16
stuff
-0.15
atts
-0.15
avenue
-0.15
likeness
-0.15
人æīį
-0.15
äºĪ
-0.14
Entire
-0.14
POSITIVE LOGITS
few
0.44
few
0.36
many
0.33
Few
0.33
Few
0.31
many
0.30
åĩłä¸ª
0.27
several
0.25
rare
0.24
MANY
0.24
Activations Density 0.117%