INDEX
Explanations
instances of first-person singular pronouns and their variations in text
New Auto-Interp
Negative Logits
zcze
-0.20
еÑĢж
-0.15
dropdown
-0.15
IMIT
-0.14
ecz
-0.14
orus
-0.14
lesia
-0.14
Nimbus
-0.14
imit
-0.13
reon
-0.13
POSITIVE LOGITS
want
0.41
wants
0.40
wanted
0.33
wanting
0.32
want
0.31
Want
0.30
Wants
0.30
muá»ijn
0.30
Want
0.28
è¦ģ
0.27
Activations Density 0.175%