INDEX
Explanations
the pronoun "it" in various contexts
New Auto-Interp
Negative Logits
оно
-0.18
Its
-0.18
Its
-0.17
its
-0.17
dt
-0.15
its
-0.14
,readonly
-0.13
orado
-0.13
ilan
-0.13
ingleton
-0.13
POSITIVE LOGITS
ty
0.18
rain
0.17
chy
0.17
takes
0.17
avid
0.17
cono
0.17
Takes
0.16
cul
0.16
snow
0.16
Cul
0.16
Activations Density 0.156%