INDEX
Explanations
instances of the pronoun "I"
New Auto-Interp
Negative Logits
itorio
-0.16
меÑĪ
-0.15
کارÛĮ
-0.14
BaseUrl
-0.14
ergy
-0.14
èĵ
-0.14
cheid
-0.14
arring
-0.14
utta
-0.14
_interval
-0.14
POSITIVE LOGITS
pu
0.15
esco
0.15
cri
0.15
ffi
0.14
Ñįлек
0.14
stit
0.14
AS
0.14
elts
0.13
ains
0.13
pot
0.13
Activations Density 0.027%