INDEX
Explanations
the pronoun "it" along with its contexts and variations
New Auto-Interp
Negative Logits
atron
-0.15
SHE
-0.15
Scho
-0.14
Thor
-0.14
acon
-0.14
plat
-0.14
Nob
-0.13
obble
-0.13
hood
-0.13
VIP
-0.13
POSITIVE LOGITS
inan
0.16
culus
0.15
izzo
0.15
ạy
0.15
hend
0.15
idlo
0.14
.sul
0.14
_forum
0.14
ntax
0.14
ç®±
0.14
Activations Density 0.361%