INDEX
Explanations
phrases related to personal ownership or possession
New Auto-Interp
Negative Logits
ington
-0.16
yourselves
-0.15
inev
-0.15
unanim
-0.15
wise
-0.14
mi
-0.14
ãģ¨ãĤĤ
-0.13
Ë
-0.13
sten
-0.13
ÙĨÚ¯
-0.13
POSITIVE LOGITS
own
0.42
rtle
0.33
SELF
0.30
/her
0.28
opia
0.27
own
0.27
myself
0.26
/my
0.26
anmar
0.26
riad
0.26
Activations Density 0.119%