INDEX
Explanations
words related to various forms of possession and ownership
New Auto-Interp
Negative Logits
å¹
-0.17
pering
-0.14
lier
-0.14
Props
-0.14
Discipline
-0.14
ieres
-0.13
itta
-0.13
Hal
-0.13
ãģ
-0.13
Injection
-0.13
POSITIVE LOGITS
cki
0.16
coe
0.16
ãĤ¤ãĥĦ
0.15
ниÑĩ
0.15
dsa
0.15
jsc
0.14
деÑĢ
0.14
zes
0.14
öh
0.14
ihn
0.14
Activations Density 0.255%