INDEX
Explanations
possessive pronouns indicating ownership or personal connection
New Auto-Interp
Negative Logits
aca
-0.15
Gap
-0.15
Claus
-0.15
etz
-0.15
069
-0.14
odelist
-0.14
aroo
-0.14
اÙĩÙħ
-0.14
indsight
-0.14
.addColumn
-0.14
POSITIVE LOGITS
opsy
0.17
choice
0.16
ticket
0.15
sole
0.14
omorphic
0.14
ãĥªãĥ¼ãĤº
0.14
own
0.14
WISE
0.14
spit
0.13
lie
0.13
Activations Density 0.053%