INDEX
Explanations
expressions related to altruism and assistance
New Auto-Interp
Negative Logits
ounty
-0.17
toa
-0.17
ounce
-0.15
cuer
-0.15
ymous
-0.15
Heller
-0.15
erken
-0.15
Çİ
-0.14
اÙĦÙħغ
-0.14
mktime
-0.14
POSITIVE LOGITS
desk
0.15
undefined
0.15
Chern
0.14
Others
0.14
Others
0.14
ESC
0.14
662
0.14
others
0.14
ana
0.14
622
0.14
Activations Density 0.046%