INDEX
Explanations
phrases related to ownership or possession
New Auto-Interp
Negative Logits
inia
-0.16
arts
-0.15
olf
-0.15
SKI
-0.14
igli
-0.14
нÑĮо
-0.14
spb
-0.14
oglob
-0.14
âĵĺ
-0.13
ÃŁe
-0.13
POSITIVE LOGITS
asco
0.16
Bout
0.15
azer
0.14
Hum
0.14
Hum
0.14
ahas
0.14
induction
0.13
WXYZ
0.13
/is
0.13
ynes
0.13
Activations Density 0.027%