INDEX
Explanations
instances of the word "there" and its variations in context
New Auto-Interp
Negative Logits
s
-0.16
ίγ
-0.16
openh
-0.15
_REV
-0.15
Consort
-0.14
اظ
-0.14
emin
-0.13
756
-0.13
Sug
-0.13
tright
-0.13
POSITIVE LOGITS
را
0.18
zelf
0.17
-Origin
0.16
erval
0.15
Wunused
0.15
self
0.14
aly
0.14
urch
0.14
own
0.14
elf
0.14
Activations Density 0.190%