INDEX
Explanations
prepositions indicating focus or direction
New Auto-Interp
Negative Logits
bh
-0.16
PT
-0.15
ãĤīãģı
-0.15
åł
-0.15
åĻ
-0.15
HEMA
-0.15
anzi
-0.14
ÙĪÛĮزÛĮ
-0.14
_OCCURRED
-0.14
ãĥ¼ãĥ©
-0.14
POSITIVE LOGITS
Bene
0.14
phys
0.14
/fixtures
0.14
focus
0.14
circum
0.14
the
0.14
iza
0.14
lix
0.14
.tex
0.14
focus
0.14
Activations Density 0.025%