INDEX
Explanations
phrases indicating personal ownership or possession
New Auto-Interp
Negative Logits
mour
-0.13
-(
-0.12
Å
-0.12
ìłķìĿĦ
-0.12
æĢ§çļĦ
-0.12
beck
-0.12
好çļĦ
-0.12
elman
-0.12
_MIX
-0.12
ëĭ¥
-0.12
POSITIVE LOGITS
venida
0.14
.Word
0.14
à¤ĩन
0.13
consc
0.13
ereo
0.12
alsy
0.12
é±
0.12
ove
0.12
633
0.11
opor
0.11
Activations Density 0.387%