INDEX
Explanations
instances of directional and locational phrases
New Auto-Interp
Negative Logits
ssel
-0.16
zin
-0.15
ssue
-0.14
uem
-0.14
ushima
-0.14
utin
-0.14
encil
-0.14
sounding
-0.14
adena
-0.13
TD
-0.13
POSITIVE LOGITS
aus
0.17
pez
0.15
\Context
0.15
666
0.15
rame
0.15
acha
0.14
399
0.14
Wonderland
0.14
PROCUREMENT
0.14
imenti
0.14
Activations Density 0.031%