INDEX
Explanations
instances of the word "one" and its variations in context, indicating a focus on individual experiences or statements
New Auto-Interp
Negative Logits
ä¸ĺ
-0.17
å´İ
-0.16
abbo
-0.15
киÑĪ
-0.15
pite
-0.15
олоÑģ
-0.14
.nlm
-0.14
ãģİ
-0.14
OTE
-0.14
)application
-0.13
POSITIVE LOGITS
else
0.26
except
0.19
except
0.18
ever
0.18
Nobody
0.17
aten
0.16
else
0.15
_else
0.15
/no
0.15
else
0.15
Activations Density 0.044%