INDEX
Explanations
possessive pronouns related to individuals and their belongings
New Auto-Interp
Negative Logits
vc
-0.18
xffffff
-0.16
ising
-0.15
оÑģÑĮ
-0.15
egend
-0.14
пÑĢ
-0.14
itself
-0.14
emi
-0.14
Crescent
-0.14
iale
-0.13
POSITIVE LOGITS
bage
0.18
rtle
0.16
286
0.15
Rolled
0.14
opr
0.14
rolls
0.14
odem
0.14
rors
0.14
ÙĨج
0.13
elik
0.13
Activations Density 0.521%