INDEX
Explanations
spatial relationships and locations within a descriptive context
New Auto-Interp
Negative Logits
-has
-0.17
hadn
-0.15
\Has
-0.14
HAS
-0.14
(has
-0.14
’Ñıз
-0.14
Didn
-0.14
didn
-0.14
.Has
-0.13
_HAS
-0.13
POSITIVE LOGITS
are
0.42
çļĦæĺ¯
0.33
is
0.29
there
0.29
were
0.25
estão
0.24
lies
0.23
_are
0.23
ÙĩستÙĨد
0.22
theres
0.22
Activations Density 0.186%